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PREFACE 


This book contains a first-year graduate course in which the basic techniques and 
theorems of analysis are presented in such a way that the intimate connections 
between its various branches are strongly emphasized. The traditionally separate 
subjects of “real analysis” and “complex analysis” are thus united; some of the 
basic ideas from functional analysis are also included. 

Here are some examples of the way in which these connections are demon- 
strated and exploited. The Riesz representation theorem and the Hahn-Banach 
theorem allow one to “ guess ” the Poisson integral formula. They team up in the 
proof of Runge’s theorem. They combine with Blaschke’s theorem on the zeros of 
bounded holomorphic functions to give a proof of the Miintz-Szasz theorem, 
which concerns approximation on an interval. The fact that L 2 is a Hilbert space 
is used in the proof of the Radon-Nikodym theorem, which leads to the theorem 
about differentiation of indefinite integrals, which in turn yields the existence of 
radial limits of bounded harmonic functions. The theorems of Plancherel and 
Cauchy combined give a theorem of Paley and Wiener which, in turn, is used in 
the Denjoy-Carleman theorem about infinitely differentiable functions on the real 
line. The maximum modulus theorem gives information about linear transform- 
ations on If -spaces. 

Since most of the results presented here are quite classical (the novelty lies in 
the arrangement, and some of the proofs are new), I have not attempted to docu- 
ment the source of every item. References are gathered at the end, in Notes and 
Comments. They are not always to the original sources, but more often to more 
recent works where further references can be found. In no case does the absence 
of a reference imply any claim to originality on my part. 

The prerequisite for this book is a good course in advanced calculus 
(settheoretic manipulations, metric spaces, uniform continuity, and uniform 
convergence). The first seven chapters of my earlier book “ Principles of Mathe- 
matical Analysis ” furnish sufficient preparation. 


xiii 




Xiv PREFACE 


Experience with the first edition shows that first-year graduate students can 
study the first 15 chapters in two semesters, plus some topics from 1 or 2 of the 
remaining 5. These latter are quite independent of each other. The first 15 should 
be taken up in the order in which they are presented, except for Chapter 9, which 
can be postponed. 

The most important difference between this third edition and the previous 
ones is the entirely new chapter on differentiation. The basic facts about differen- 
tiation are now derived from the existence of Lebesgue points, which in turn is an 
easy consequence of the so-called “ weak type ” inequality that is satisfied by the 
maximal functions of measures on euclidean spaces. This approach yields strong 
theorems with minimal effort. Even more important is that it familiarizes stu- 
dents with maximal functions, since these have become increasingly useful in 
several areas of analysis. 

One of these is the study of the boundary behavior of Poisson integrals. A 
related one concerns H p -spaces. Accordingly, large parts of Chapters 11 and 17 
were rewritten and, I hope, simplified in the process. 

I have also made several smaller changes in order to improve certain details: 
For example, parts of Chapter 4 have been simplified; the notions of equi- 
continuity and weak convergence are presented in more detail; the boundary 
behavior of conformal maps is studied by means of Lindelof’s theorem about 
asymptotic valued of bounded holomorphic functions in a disc. 

Over the last 20 years, numerous students and colleagues have offered com- 
ments and criticisms concerning the content of this book. I sincerely appreciated 
all of these, and have tried to follow some of them. As regards the present edition, 
my thanks go to Richard Rochberg for some useful last-minute suggestions, and I 
especially thank Robert Burckel for the meticulous care with which he examined 
the entire manuscript. 


Walter Rudin 



PROLOGUE 
THE EXPONENTIAL FUNCTION 


This is the most important function in mathematics. It is defined, for every com- 
plex number z, by the formula 

exp (z)= Z (!) 

n = 0 nl 

The series (1) converges absolutely, for every z and converges uniformly on every 
bounded subset of the complex plane. Thus exp is a continuous function. The 
absolute convergence of (1) shows that the computation 


cc k cc um 0° i n f oo 


(a -f b) n 
n\ 


is correct. It gives the important addition formula 


exp (a) exp ( b ) = exp (a + b ), 


(2) 


valid for all complex numbers a and b. 

We define the number e to be exp (1), and shall usually replace exp (z) by the 
customary shorter expression e z . Note that e° = exp (0) = 1, by (1). 


Theorem 


(a) For every complex z we have e z ^ 0. 

(b) exp is its own derivative: exp' (z) = exp (z). 
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(c) The restriction of exp to the real axis is a monotonically increasing positive 
function , and 

e*— ► oo as x—> oo, e*— ► 0 as x— ► — oo. 

(d) There exists a positive number n such that e Kl/2 = i and such that e z — 1 if 
and only if z/(2ni) is an integer. 

(e) exp is a periodic function , with period Ini. 

(/) The mapping t— > e lt maps the real axis onto the unit circle. 

(g) If w is a complex number and w ^ 0, then w = e z for some z. 


Proof By (2), e z • e z = e z z — e° = 1. This implies (a). Next, 


exp' (z) 


lim 

h-0 


exp (z + h) — exp (z) 


exp (z) lim 

h-+ 0 


exp (h) — 1 


exp (z). 


The first of the above equalities is a matter of definition, the second follows 
from (2), and the third from (1), and ( b ) is proved. 

That exp is monotonically increasing on the positive real axis, and that 
e*— ► oo as x— ► oo, is clear from (1). The other assertions of (c) are conse- 
quences of e x • e~ x = 1. 

For any real number t, (1) shows that e~ ix is the complex conjugate of e ix . 
Thus 


| git 1 2 — git . ~gfi __ git . g~it — git ~ it __ gO _ j 


or 

|^| = 1 (t real). (3) 

In other words, if t is real, e ix lies on the unit circle. We define cos t , sin t to 
be the real and imaginary parts of e ix : 

cos t = Re |V'], sin t = Im \_e ix ~\ ( t real). (4) 

If we differentiate both sides of Euler’s identity 

e lt = cos t + i sin t, (5) 

which is equivalent to (4), and if we apply (h), we obtain 

cos' t + i sin' t = ie ix — — sin t + i cos t, 

so that 


cos = — sin, sin 
The power series (1) yields the representation 


cos. 


, r r r 
COSf = 1 - 2 ! + ^- 6 ! + -“- 


( 6 ) 


( 7 ) 
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Take t = 2. The terms of the series (7) then decrease in absolute value (except 
for the first one) and their signs alternate. Hence cos 2 is less than the sum of 
the first three terms of (7), with t = 2; thus cos 2 < — Since cos 0=1 and 
cos is a continuous real function on the real axis, we conclude that there is a 
smallest positive number t 0 for which cos t 0 = 0. We define 

n = 2t 0 . ( 8 ) 

It follows from (3) and (5) that sin t 0 = ±1. Since 

sin' ( t ) = cos t > 0 


on the segment (0, t 0 ) and since sin 0 = 0, we have sin t 0 > 0, hence sin t 0 = 
1, and therefore 

e ni/2 = i. (9) 


It follows that e Kl = i 2 = — 1, e 2nl = (— l) 2 = 1, and then e 2nin = 1 for 
every integer n. Also, ( e ) follows immediately : 


n z + 2ni 


: <? e 2ni = e 2 . 


(10) 


If z = x + iy, x and y real, then e z = e x e iy \ hence | e z \ — e x . If e z = 1, we there- 
fore must have e x = 1, so that x = 0; to prove that y/2n must be an integer, it 
is enough to show that e iy ^ 1 if 0 < y < 2 k , by (10). 

Suppose 0 < y < 2 k, and 

e iyl 4 = u + iv ( u and v real). (11) 


Since 0 < y/ 4 < tc/ 2, we have u > 0 and v > 0. Also 

e iy = (u + iv)* = u* — 6u 2 v 2 + v* -f 4 iuv(u 2 — v 2 ). (12) 

The right side of (12) is real only if u 2 = v 2 ; since u 2 + v 2 = 1, this happens 
only when u 2 = v 2 = -j, and then (12) shows that 

= - 1 ^ 1 . 


This completes the proof of ( d ). 

We already know that t^> e ix maps the real axis into the unit circle. To 
prove (/), fix w so that | w | = 1 ; we shall show that w = e lt for some real t. 
Write w = u 4* iv, u and v real, and suppose first that u > 0 and v > 0. Since 
u < 1, the definition of k shows that there exists a t, 0 < t < tc/2, such that 
cos t = u\ then sin 2 t = 1 — u 2 = v 2 , and since sin t > 0 if 0 < t < tc/2, we 
have sin t — v. Thus w = e lt . 

If u < 0 and v > 0, the preceding conditions are satisfied by — iw. Hence 
— iw = e ix for some real t, and w = e i{x+ni2 \ Finally, if v < 0, the preceding 
two cases show that —w = e lt for some real t, hence w = e l{x * n \ This com- 
pletes the proof of (/). 

If w 7^ 0, put ct = w/\w\. Then w = | w | a. By (c), there is a real x such 
that | w\ = e x . Since |a| = 1, (/) shows that a = e iy for some real y. Hence 
w = e x+iy . This proves (g) and completes the theorem. / III 
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We shall encounter the integral of (1 + x 2 ) -1 over the real line. To evaluate 
it, put cp(t) = sin t / cos t in ( — rc/2, k/2). By (6), cp' = 1 + (p 2 . Hence cp is a mono- 
tonically increasing mapping of ( — tt/2, tz/2) onto ( — oo, oo), and we obtain 

f 00 dx C n/2 cp\t) dt C nl2 

J-oo i + ^ 2 ”J -, /2 i + <p 2 W "L /2 



CHAPTER 

ONE 

ABSTRACT INTEGRATION 


Toward the end of the nineteenth century it became clear to many mathemati- 
cians that the Riemann integral (about which one learns in calculus courses) 
should be replaced by some other type of integral, more general and more flex- 
ible, better suited for dealing with limit processes. Among the attempts made in 
this direction, the most notable ones were due to Jordan, Borel, W. H. Young, 
and Lebesgue. It was Lebesgue’s construction which turned out to be the most 
successful. 

In brief outline, here is the main idea: The Riemann integral of a function / 
over an interval [a, b] can be approximated by sums of the form 

imntE,) 

i — 1 

where E l9 ..., E n are disjoint intervals whose union is [a, b], m(£,) denotes the 
length of and r, e E x for i = 1, ..., n. Lebesgue discovered that a completely 
satisfactory theory of integration results if the sets E x in the above sum are 
allowed to belong to a larger class of subsets of the line, the so-called 
“ measurable sets,” and if the class of functions under consideration is enlarged to 
what he called “measurable functions.” The crucial set-theoretic properties 
involved are the following: The union and the intersection of any countable 
family of measurable sets are measurable; so is the complement of every measur- 
able set; and, most important, the notion of “length” (now called “measure”) 
can be extended to them in such a way that 

m(E 1 u E 2 u £ 3 u • • •) = m(£ 1 ) 4 - m(E 2 ) 4- m(E 3 ) + • • • 


5 
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for every countable collection {£ f } of pairwise disjoint measurable sets. This pro- 
perty of m is called countable additivity . 

The passage from Riemann’s theory of integration to that of Lebesque is a 
process of completion (in a sense which will appear more precisely later). It is of 
the same fundamental importance in analysis as is the construction of the real 
number system from the rationals. 

The above-mentioned measure m is of course intimately related to the 
geometry of the real line. In this chapter we shall present an abstract (axiomatic) 
version of the Lebesgue integral, relative to any countably additive measure on 
any set. (The precise definitions follow.) This abstract theory is not in any way 
more difficult than the special case of the real line; it shows that a large part of 
integration theory is independent of any geometry (or topology) of the underlying 
space; and, of course, it gives us a tool of much wider applicability. The existence 
of a large class of measures, among them that of Lebesgue, will be established in 
Chap. 2. 

Set-Theoretic Notations and Terminology 

1.1 Some sets can be described by listing their members. Thus {x u ..., x n } is the 
set whose members are x 1? ..., x n ; and {x} is the set whose only member is x. 
More often, sets are described by properties. We write 

{x:P} 

for the set of all elements x which have the property P. The symbol 0 denotes 
the empty set. The words collection, family, and class will be used synonymously 
with set. 

We write x e A if x is a member of the set A; otherwise x $ A. If B is a subset 
of A, i.e., if x e B implies x e A, we write* B <= A. If B c A and A <= B, then 
A = B. If B a A and A # B, B is a proper subset of A. Note that 0 c= A for every 
set A. 

/4uB and A n B are the union and intersection of A and B, respectively. If 
{A a } is a collection of sets, where a runs through some index set I, we write 

U^o, and f ) A * 

a el a el 

for the union and intersection of { A J : 

(J A a = {x: x e A a for at least one a e /} 

a e / 

A a = {x: x e A a for every a e /}. 

a e / 

If / is the set of all positive integers, the customary notations are 

(U and rW 

n — 1 n= 1 
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If no two members of {A a } have an element in common, then {A a } is a 
disjoint collection of sets. 

We write A — B = {x: x e A, x 4 B}> and denote the complement of A by A c 
whenever it is clear from the context with respect to which larger set the com- 
plement is taken. 

The cartesian product A Y x • x A n of the sets A lf . .., A n is the set of all 
ordered n-tuples (a l9 . . . , a n ) where a t e A t for i = 1, . . . , n. 

The real line (or real number system) is R l , and 

R k = R 1 x • x R 1 (k factors). 

The extended real number system is R 1 with two symbols, oo and — oo, adjoined, 
and with the obvious ordering. If — oo < a < b < oo, the interval [a, b] and the 
segment (a, b) are defined to be 

[a, h] = {x: a < x < b} 9 (a, b) = {x: a < x < b}. 

We also write 

[a, b) = {x: a < x < b }, (a, b] = {x: a < x < b}. 

If E a [— oo, oo ] and E ^ 0, the least upper bound (supremum) and great- 
est lower bound (infimum) of E exist in [— oo, oo] and are denoted by sup E and 
inf E. 

Sometimes (but only when sup E e E) we write max E for sup E. 

The symbol 

/:*-> Y 

means that / is a function (or mapping or transformation) of the set X into the set 
Y ; i.e., / assigns to each x e X an element f(x) e Y. If A a X and B c Y, the 
image of A and the inverse image (or pre-image) of B are 

f(A) = {y: y =/(x) for some x e A}, 

f~\B) = {x:/(x) e B}. 

Note that f~ i (B) may be empty even when B # 0. 

The domain of /is X. The range of /is f{X). 

If f(X) = Y,f is said to map A' onto Y. 

We write f~ 1 (y), instead of / -1 ({y}), for every y e 7. If f~ 1 (y) consists of at 
most one point, for each ye Y, /is said to be one-to-one. If /is one-to-one, then 
f~ l is a function with domain /(X) and range 

If/: ► [— oo, oo] and £ c= it is customary to write sup xeE f(x) rather 

than sup /(£). 

lif :X^>Y and g : 7 — ► Z, the composite function g ° /: — ► Z is defined by 

the formula 


(gof)(x) = g(f(x)) (xeX). 
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If the range of / lies in the real line (or in the complex plane), then /is said to 
be a real function (or a complex function). For a complex function f the statement 
“/ >0” means that all values f(x) of / are nonnegative real numbers. 

The Concept of Measurability 

The class of measurable functions plays a fundamental role in integration theory. 
It has some basic properties in common with another most important class of 
functions, namely, the continuous ones. It is helpful to keep these similarities in 
mind. Our presentation is therefore organized in such a way that the analogies 
between the concepts topological space , open set , and continuous function , on the 
one hand, and measurable space , measurable set , and measurable function , on the 
other, are strongly emphasized. It seems that the relations between these concepts 
emerge most clearly when the setting is quite abstract, and this (rather than a 
desire for mere generality) motivates our approach to the subject. 

1.2 Definition 

(a) A collection t of subsets of a set X is said to be a topology in X if t has 
the following three properties : 

(i) 0 g t and X e x. 

(ii) If V ( g x for i = 1, . . . , n, then V l n V 2 n ■ ■ ■ n V„ e x. 

(iii) If { V a } is an arbitrary collection of members of x (finite, countable, or 
uncountable), then (J a V a e x. 

(b) If x is a topology in X , then X is called a topological space , and the 
members of x are called the open sets in X. 

(c) If X and Y are topological spaces and if / is a mapping of X into Y, then 
/is said to be continuous provided that / -1 (F) is an open set in X for 
every open set V in Y. 

1.3 Definition 

(a) A collection 501 of subsets of a set X is said to be a o-algebra in X if 501 
has the following properties : 

(i) X e 

(ii) If A e 001, then A c e 501, where A c is the complement of A relative to 
X. 

(iii) If ,4 = (J® =1 A n and if e 50t for n = 1, 2, 3, . . . , then A e 

(b) If 001 is a a-algebra in X , then X is called a measurable space , and the 
members of 001 are called the measurable sets in X. 

( c ) If AT is a measurable space, Y is a topological space, and / is a mapping 
of X into 7, then / is said to be measurable provided that / _1 (F) is a 
measurable set in X for every open set V in Y. 
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It would perhaps be more satisfactory to apply the term “ measurable space ” 
to the ordered pair (AT, 901), rather than to X. After all, X is a set, and X has not 
been changed in any way by the fact that we now also have a a-algebra of its 
subsets in mind. Similarly, a topological space is an ordered pair ( X , t) 7 But if this 
sort of thing were systematically done in all mathematics, the terminology would 
become awfully cumbersome. We shall discuss this again at somewhat greater 
length in Sec. 1.21. 

1.4 Comments on Definition 1.2 The most familiar topological spaces are the 
metric spaces. We shall assume some familiarity with metric spaces but shall give 
the basic definitions, for the sake of completeness. 

A metric space is a set X in which a distance function (or metric) p is defined, 
with the following properties : 

(a) 0 < p(x, y) < oo for all x and y e X. 

(b) p(x, y) = 0 if and only if x = y. 

(c) p(x, y) = p(y, x) for all x and y g X. 

(< d ) p(x, y) < p(x, z) 4- p(z, y) for all x, y, and z e X. 

Property ( d ) is called the triangle inequality. 

If x e X and r > 0, the open ball with center at x and radius r is the set 
{yeX\p{x, y) < r}. 

If X is a metric space and if t is the collection of all sets E c= X which are 

arbitrary unions of open balls, then t is a topology in X. This is not hard to 

verify; the intersection property depends on the fact that if x e B x n B 2 , where 
B l and B 2 are open balls, then x is the center of an open ball Bc^ n B 2 . We 
leave this as an exercise. 

For instance, in the real line R 1 a set is open if and only if it is a union of 
open segments (a, b). In the plane R 2 , the open sets are those which are unions of 
open circular discs. 

Another topological space, which we shall encounter frequently, is the 
extended real line [— oo, oo]; its topology is defined by declaring the following 
sets to be open: (a, h), [ — oo, a), (a, oo], and any union of segments of this type. 

The definition of continuity given in Sec. 1.2(c) is a global one. Frequently it 
is desirable to define continuity locally: A mapping/ of X into Y is said to be 
continuous at the point x 0 g X if to every neighborhood V of /(x 0 ) there corre- 
sponds a neighborhood W of x 0 such that f(W) a V. 

(A neighborhood of a point x is, by definition, an open set which contains x.) 
When X and Y are metric spaces, this local definition is of course the same 
as the usual epsilon-delta definition, and is equivalent to the requirement that 
lim f(x n ) = f(x 0 ) in Y whenever lim x„ = x 0 in X. 

The following easy proposition relates the local and global definitions of con- 
tinuity in the expected manner : 

1.5 Proposition Let X and Y be topological spaces. A mapping f of X into Y is 
continuous if and only if f is continuous at every point of X. 
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Proof If / is continuous and x 0 e X , then / _1 (F) is a neighborhood of x 0 , 
for every neighborhood V of f(x 0 ). Since /(/ _1 (K)) c= V, it follows that /is 
continuous at x 0 . 

If / is continuous at every point of X and if V is open in 7, every point 
x e f~ l {V) has a neighborhood W x such that f(W x ) cz V. Therefore W x c= 
/ _1 (F). It follows that f~ l {V) is the union of the open sets W x , sof~ 1 (V) is 
itself open. Thus / is continuous. //// 

1.6 Comments on Definition 1.3 Let be a a-algebra in a set X. Referring to 
Properties (i) to (iii) of Definition 1.3(a), we immediately derive the following 
facts. 

(a) Since 0 = X c , (i) and (ii) imply that 0 e 

(b) Taking A n + 1 = 4 n + 2 = ■ • • = 0 in (iii), we see that A x u u • • • u A n 
e ffll if A { for i = 1, . . . , n. 

(c) Since 

OO / 00 \c 

n^= ( im > 

n — l \n = 1 / 

is closed under the formation of countable (and also finite) intersec- 
tions. 

(d) Since A — B = B c n 4, we have >1 — Be5Rif/le5R and B e 

The prefix a refers to the fact that (iii) is required to hold for all countable 
unions of members of If (iii) is required for finite unions only, then is called 
an algebra of sets. 

1.7 Theorem Let Y and Z be topological spaces , and let g: Y — ► Z be contin- 
uous. 

(a) If X is a topological space , if f: X— ► Y is continuous , and if h = g °f then 
h: X — ► Z is continuous. 

(b) If X is a measurable space , if f: X — ► Y is measurable , and ifh = g°f then 
h: X-+Z is measurable. 

Stated informally, continuous functions of continuous functions are contin- 
uous; continuous functions of measurable functions are measurable. 

Proof If V is open in Z, then g~ A (F) is open in 7, and 

k-\V)=f-\g-\V)). 


If /is continuous, it follows that h 1 (V) is open, proving (a). 

If /is measurable, it follows that h~ l (V) is measurable, proving ( b ). //// 
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1.8 Theorem Let u and v be real measurable functions on a measurable space 
X , let O be a continuous mapping of the plane into a topological space Y , and 
define 


h(x) = 0(m(x), v(x)) 

for x e X. Then h: X — ► Y is measurable. 

Proof Put f (x) = (m(x), v(x)). Then / maps X into the plane. Since h = O o f 
Theorem 1.7 shows that it is enough to prove the measurability of /. 

If R is any open rectangle in, the plane, with sides parallel to the axes, 
then R is the cartesian product of two segments and I 2 , and 

f- l (R) = u- l (I 1 )nv~ l (I 2 ) 9 

which is measurable, by our assumption on u and v. Every open set V in the 
plane is a countable union of such rectangles R i9 and since 

f~ i (V)=f- i ( U/ -1 (Ki), 

\/= 1 / i= 1 

f~\V) is measurable. //// 

1.9 Let X be a measurable space. The following propositions are corollaries of 
Theorems 1.7 and 1.8: 

(a) Iff=u + iv , where u and v are real measurable functions on X, then f is a 
complex measurable function on X. 

This follows from Theorem 1.8, with <I>(z) = z. 

(b) Iff = u + iv is a complex measurable function on X , then u, v, and \f\ are real 
measurable functions on X. 

This follows from Theorem 1.7, with g(z) = Re (z), Im (z), and | z | . 

(c) Iff and g are complex measurable functions on X, then so are f + g and fg. 

For real / and g this follows from Theorem 1.8, with 

0(s, t) = s + t 

and 0(s, t) = st. The complex case then follows from (a) and ( b ). 

(d) If E is a measurable set in X and if 

. . f 1 if x e E 

Ux) ~ |o if me 

then Xe is a measurable function. 

This is obvious. We call Xe the characteristic function of the set E. The 
letter x will be reserved for characteristic functions throughout this book. 

( e ) If f is a complex measurable function on X , there is a complex measurable 
function oc on X such that | a | = 1 andf = a | f\ . 
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Proof Let E = {x:f(x) = 0}, let Y be the complex plane with the origin 
removed, define cp(z) = z/ 1 z | for z e 7, and put 

= <p(f(x) + Xe(*)) (* e X). 

If x e E, a(x) = 1; if x $ E, a(x) =/(x)/|/(x)| . Since q> is continuous on Y 
and since E is measurable (why?), the measurability of a follows from (c), ( d ), 
and Theorem 1.7. //// 


We now show that a-algebras exist in great profusion. 

1.10 Theorem If $F is any collection of subsets of X , there exists a smallest 
a-algebra 901* in X such that $F c= 901*. 


This 901* is sometimes called the a-algebra generated by 3F . 

Proof Let Q be the family of all a-algebras 901 in X which contain & . Since 
the collection of all subsets of X is such a a-algebra, Q is not empty. Let 901* 
be the intersection of all 90? e Q. It is clear that 3F c= 901* and that 901* lies in 
every a-algebra in X which contains & . To complete the proof, we have to 
show that 901* is itself a a-algebra. 

If A n e 901* for n— 1, 2, 3, ... , and if 901 e Q, then A n e 901, so (J A n e 901, 
since 901 is a a-algebra. Since (J A n e 901 for every 901 e Q, we conclude that 
[j A n e 901*. The other two defining properties of a a-algebra are verified in 
the same manner. //// 

1.11 Bore! Sets Let X be a topological space. By Theorem 1.10, there exists a 
smallest a-algebra 38 in X such that every open set in X belongs to The 
members of £8 are called the Borel sets of X. 

In particular, closed sets are Borel sets (being, by definition, the complements 
of open sets), and so are all countable unions of closed sets and all countable 
intersections of open sets. These last two are called Ff s and G/s, respectively, 
and play a considerable role. The notation is due to Hausdorff. The letters F and 
G were used for closed and open sets, respectively, and a refers to union ( Summe ), 
3 to intersection (Durchschnitt). For example, every half-open interval [a, b) is a 
G s and an F a in JR 1 . 

Since 0$ is a a-algebra, we may now regard X as a measurable space, with the 
Borel sets playing the role of the measurable sets; more concisely, we consider the 
measurable space ( X , ^). If /: X — ► Y is a continuous mapping of X , where Y is 
any topological space, then it is evident from the definitions that / _1 (F) e 08 for 
every open set V in Y. In other words, every continuous mapping of X is Borel 
measurable. 

Borel measurable mappings are often called Borel mappings , or Borel func- 
tions. 
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1.12 Theorem Suppose 901 is a a -algebra in X 9 and Y is a topological space. 
Let f map X into Y . 

(a) IfQ is the collection of all sets E a Y such that f~ 1 (E) e 901, then Q is a 
o-algebra in Y. 

(b) Iff is measurable and E is a Borel set in Y , then f~ 1 (E) e 901. 

(c) // Y = [— oo, oo] and / -1 (( a, oo]) e 901 for every real a, then f is measur- 
able. 

(d) If f is measurable , if Z is a topological space , if g: Y — > Z is a Borel 
mapping , and ifh — gof then h: X — > Z is measurable. 

Part (c) is a frequently used criterion for the measurability of real-valued 
functions. (See also Exercise 3.) Note that (d) generalizes Theorem 1.7(h). 

Proof (a) follows from the relations 

f~\Y) = X, 

f~ l (Y-A) = X-f-\A), 

and ^ A 2 v ■)=f~ 1 (A l ) u/~ 1 (A 2 ) u •••. 

To prove (h), let Q be as in (a); the measurability of / implies that Q 
contains all open sets in T, and since Q is a cr-algebra, Q contains all Borel 
sets in Y. 

To prove (c), let Q be the collection of all E c [— oo, oo] such that 
f~ 1 {E) e 90t. Choose a real number a, and choose oc n < a so that a „— > a as 
n— » oo. Since (a„, oo] e Q for each n, since 

00 00 

[-00, a) = (J[-°°> a »] = U ( a » ’ °°] c > 

n = 1 n= 1 

and since (a) shows that Q is a <7-algebra, we see that [— oo, a) e Q. The same 
is then true of 


(a, 0) = [-oo, jft) n (a, oo]. 

Since every open set in [— oo, oo] is a countable union of segments of the 
above types, Q contains every open set. Thus / is measurable. 

To prove (<2), let V c = Z be open. Then g~ l {V) is a Borel set of 7, and 
since 


h-\V)=f-\g~\V)l 


(b) shows that h X (F) e 901. //// 

1.13 Definition Let {a n } be a sequence in [ — oo, oo], and put 


b k = sup {a*, a k+1 , a k+2 , ...} ( k = 1, 2, 3, ...) 


( 1 ) 
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and 


P = inf {b u b 2 ,b 3 , 

We call P the upper limit of { a n }, and write 

p = lim sup a n . 


(2) 


(3) 


The following properties are easily verified: First, b x > b 2 > b 3 > • • so 
that ► P as /c— ► oo; secondly, there is a subsequence {a ni } of {a n } such that 
a n — > P as i— ► oo, and p is the largest number with this property. 

The lower limit is defined analogously: simply interchange sup and inf in 
(1) and (2). Note that 


lim inf a n = — lim sup ( — a n ). 

n~* oo w-*oo 

If {a n } converges, then evidently 

lim sup a n = lim inf a n = lim a n . 


(4) 


(5) 


Suppose {/„} is a sequence of extended-real functions on a set X. Then 
sup f n and lim sup f n are the functions defined on X by 


(sup/.) 


sup f n J(x) = sup (/„(*)), 

lim sup /Ax) = lim sup (f„(x)). 


( 6 ) 

(7) 


If 


f(x) = lim f n (x), (8) 

n-> oo 

the limit being assumed to exist at every x e X, then we call / the pointwise 
limit of the sequence {/„}. 


1.14 Theorem If f n : X-* [— oo, oo] is measurable, for n = 1, 2, 3, ... , and 
g = sup /„, h = lim sup /„, 

n > 1 oo 

t/ien h are measurable. 


Proof # -1 ((a, oo]) = (J® =1 / n -1 ((a, oo]). Hence Theorem 1.12(c) implies that 
g is measurable. The same result holds of course with inf in place of sup, and 
since 


ft = inf 
k£ 1 



it follows that h is measurable. 


Illl 
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Corollaries 

(a) The limit of every pointwise convergent sequence of complex measurable 
functions is measurable. 

(b) Iff and g are measurable ( with range in [—00, 00]), then so are max {f g] 
and min {f g). In particular , this is true of the functions 

f + = max {f 0 } and /" = -min {f 0 }. 

1.15 The above functions f + and f~ are called the positive and negative parts of f 
We have |/| =/ + +/“ and / =/ + — /“, a standard representation of / as a 
difference of two nonnegative functions, with a certain minimality property: 

Proposition Iff = g — h, g > 0, and /i > 0, then f + <g and f~ < h. 

Proof /< g and 0 < g clearly implies max {/, 0} < g. hh 

Simple Functions 

1.16 Definition A complex function s on a measurable space X whose range 
consists of only finitely many points will be called a simple function. Among 
these are the nonnegative simple functions, whose range is a finite subset of 
[0, 00). Note that we explicitly exclude 00 from the values of a simple func- 
tion. 

If a l5 ..., a„ are the distinct values of a simple function s, and if we set 
= { x : s(x) = a*}, then clearly 


S= 

1 = 1 

where Xa ( * s the characteristic function of A i9 as defined in Sec. 1 . 9 (d). 

It is also clear that s is measurable if and only if each of the sets A t is 
measurable. 


1.17 Theorem Let f : X — * [0, 00] be measurable. There exist simple measur- 
able functions s n on X such that 

(a) 0 < Si < s 2 < * * * < f 

(b) s„(x)— ► f(x) as n— ► 00, for every x e X. 


Proof Put d n = 2 n . To each positive integer n and each real number t cor- 
responds a unique integer k = k n (t) that satisfies kd n < t < (k + 1)8 n . Define 


<Pn(t) = 



if 0 ^ t < n 
if n ^ t ^ 00. 


(1) 
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Each (p n is then a Borel function on [0, oo], 

t - S n < <p n (t) <t if 0 < t < n, (2) 

0 < <p x < q> 2 < • < U and <p„(t)— ► t as n— ► oo, for every t e [0, oo]. It 
follows that the functions 

S n =<Pn°f ( 3 ) 

satisfy (a) and (b); they are measurable, by Theorem 1.12(d). //// 

Elementary Properties of Measures 

1.18 Definition 

(a) A positive measure is a function p, defined on a <r-algebra SR, whose range 
is in [0, oo] and which is countably additive . This means that if (Aj is a 
disjoint countable collection of members of SR, then 

/*(Cui) = (i) 

To avoid trivialities, we shall also assume that p(A) < oo for at least one 

AeWl. 

(b) A measure space is a measurable space which has a positive measure 
defined on the <7-algebra of its measurable sets. 

(c) A complex measure is a complex-valued countably additive function 
defined on a o-algebra. 

Note : What we have called a positive measure is frequently just called a 
measure ; we add the word “positive” for emphasis. If p(E) = 0 for every E e 
then p is a positive measure, by our definition. The value oo is admissible for a 
positive measure; but when we talk of a complex measure p , it is understood that 
p(E) is a complex number, for every E e SR. The real measures form a subclass of 
the complex ones, of course. 

1.19 Theorem Let pbe a positive measure on a a-algebra Then 

(a) p(0) = 0. 

(b) p(At u • u A n ) = p(A 1 ) + • • • + p(A n ) if A u . . . , A n are pairwise disjoint 
members of SR. 

(c) A a B implies p(A) < p(B) if A e SR, B e SR. 

(d) p(A n )—> p(A) as n— ► oo if A = (J ® =1 A„, A„ e SR, and 

A^ c= A 2 <= A 3 c= • • •. 

(e) n(A n )^>n(A) as n-KX> if A = f)“ =1 A„, A„ e 2R, 

A i => A 2 => A 3 => ■ ■ ■ , 

and piAJ is finite. 
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As the proof will show, these properties, with the exception of (c), also hold 
for complex measures; ( b ) is called finite additivity; (c) is called monotonicity. 

Proof 

(a) Take A e 501 so that g(A) < oo, and take A Y = A and A 2 = A 3 = • • • = 
0 in 1.18(1). 

(b) Take A„ +1 = A n+1 = ••• = 0 in 1.18(1). 

(c) Since B = A u (B — A) and A n (B — A) = 0, we see that ( b ) implies 
m = ti{A) + n(B - A) > n(A). 

(d) Put B t = A lf and put B n = A„ — A„_ t for n = 2, 3, 4, . . . . Then B„ e 9W, 
B, r\ Bj = 0 if i # j, A„ = B Y u • • • u B„, and X = (J,“ , B, . Hence 

and /x(^) = z my 

i= 1 i = 1 

Now (d) follows, by the definition of the sum of an infinite series. 

( e ) Put C n — A y — A n . Then C x c= C 2 c= C 3 c= • , 

MQ = - M'U, 

— A = 1J C„ , and so ( d) shows that 

ti^i) ~ n(A) = - A) = lim ju(C„) = 

n-*ao n-* oo 

This implies (e). //// 

1.20 Examples The construction of interesting measure spaces requires some 
labor, as we shall see. However, a few simple-minded examples can be given 
immediately: 

(a) For any E a X, where X is any set, define g(E) = oo if E is an infinite set, 
and let g(E) be the number of points in £ if £ is finite. This \i is called the 
counting measure on X. 

(b) Fix x 0 e X , define g(E) =1 if x 0 e £ and g(E) = 0 if x 0 $ E, for any 
£ c= X. This ft may be called the unit mass concentrated at x 0 . 

(c) Let \x be the counting measure on the set {1, 2, 3, . . .}, let A n = {n, n + 1, 
n + 2, ...}. Then Q A n = 0 but ^(AJ = oo for n = 1, 2, 3, .... This 
shows that the hypothesis 

M(A i) < oo 

is not superfluous in Theorem 1.19(e). 

1.21 A Comment on Terminology One frequently sees measure spaces referred to 
as “ ordered triples ” ( X , 501, fi) where X is a set, 501 is a c-algebra in X , and \i is a 
measure defined on 501. Similarly, measurable spaces are “ ordered ppir&” (X, 901). 
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This is logically all right, and often convenient, though somewhat redundant. For 
instance, in ( X , the set X is merely the largest member of so if we know 
we also know X. Similarly, every measure has a a-algebra for its domain, by 
definition, so if we know a measure \i we also know the a-algebra on which \i 
is defined and we know the set X in which is a a-algebra. 

It is therefore perfectly legitimate to use expressions like “Let /i be a 
measure ” or, if we wish to emphasize the a-algebra or the set in question, to say 
“ Let \i be a measure on ” or “ Let \i be a measure on XT 

What is logically rather meaningless but customary (and we shall often 
follow mathematical custom rather than logic) is to say “Let X be a measure 
space”; the emphasis should not be on the set, but on the measure. Of course, 
when this wording is used, it is tacitly understood that there is a measure defined 
on some a-algebra in X and that it is this measure which is really under dis- 
cussion. 

Similarly, a topological space is an ordered pair ( X , t), where x is a topology 
in the set X , and the significant data are contained in t, not in X , but “the 
topological space X ” is what one talks about. 

This sort of tacit convention is used throughout mathematics. Most mathe- 
matical systems are sets with some class of distinguished subsets or some binary 
operations or some relations (which are required to have certain properties), and 
one can list these and then describe the system as an ordered pair, triple, etc., 
depending on what is needed. For instance, the real line may be described as a 
quadruple (R 1 , +, *, <), where + , •, and < satisfy the axioms of a complete 
archimedean ordered field. But it is a safe bet that very few mathematicians think 
of the real field as an ordered quadruple. 

Arithmetic in [0, oo] 

1.22 Throughout integration theory, one inevitably encounters oo. Qne reason is 
that one wants to be able to integrate over sets of infinite measure; after all, the 
real line has infinite length. Another reason is that even if one is primarily inter- 
ested in real-valued functions, the lim sup of a sequence of positive real functions 
or the sum of a sequence of positive real functions may well be oo at some points, 
and much of the elegance of theorems like 1.26 and 1.27 would be lost if one had 
to make some special provisions whenever this occurs. 

Let us define a+oo = oo+a = ooif0<a<oo, and 

f oo if 0 < a < oo 
a-oo = oo*a = <^ 

(0 if a = 0; 

sums and products of real numbers are of course defined in the usual way. 

It may seem strange to define 0 • oo = 0. However, one verifies without diffi- 
culty that with this definition the commutative , associative , and distributive laws 
hold in [0, oo] without any restriction. 
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The cancellation laws have to be treated with some care: a + b = a + c 
implies b — c only when a < oo, and ab = ac implies b — c only when 0 < a < oo. 
Observe that the following useful proposition holds : 

7/0 < a Y < a 2 < " • , 0 < hi <b 2 < a n ^> a, and b n — ► b , t/ien a„ b n — ► ah. 

If we combine this with Theorems 1.17 and 1.14, we see that sums and pro - 
ducts of measurable functions into [0, oo] are measurable. 

Integration of Positive Functions 

In this section, 501 will be a a-algebra in a set X and will be a positive measure 
on 501. 

1.23 Definition If s : X — ► [0, oo) is a measurable simple function, of the form 

S='L'*iXAi> ( 1 ) 

1 = 1 

where a x , ..., a„ are the distinct values of s (compare Definition 1.16), and if 
E e 501, we define 


\sdn=Y J x i n{A i nE). (2) 

JE i=l 

The convention 0 • oo = 0 is used here; it may happen that a t — 0 for some i 
and that p(Ai n E) = oo. 

If/: X — > [0, oo] is measurable, and E e 501, we define 

J f dp = sup s (3) 

E JE 

the supremum being taken over all simple measurable functions s such that 
0 <s<f 

The left member of (3) is called the Lebesgue integral of / over E , with 
respect to the measure p. It is a number in [0, oo]. 

Observe that we apparently have two definitions for / dp if /is simple, 
namely, (2) and (3). However, these assign the same value to the integral, 
since / is, in this case, the largest of the functions s which occur on the right 
of (3). 


1.24 The following propositions are immediate consequences of the definitions. 
The functions and sets occurring in them are assumed to be measurable: 

(a) IfO <f<g , then \ E f dp < \ E g dp. 

(b) If A c = B and f > 0, then \ A f dp < ] B f dp. 
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(c) Iff > 0 and c is a constant , 0 < c < oo, then 

\cfdn = c f fdn : 

JE JE 

(d) Iff(x) = 0 for all x e £, then f E f dpi = 0, even if p(E) = oo. 

(e) If p(E) = 0, then f E f dp = 0, even if f(x) = oo for every x e E. 
(/) Iff > 0, then J £ fdn = j* i E f dp. 


This last result shows that we could have restricted our definition of integra- 
tion to integrals over all of X , without losing any generality. If we wanted to 
integrate over subsets, we could then use (/) as the definition. It is purely a 
matter of taste which definition is preferred. 

One may also remark here that every measurable subset E of a measure 
space X is again a measure space, in a perfectly natural way: The new measur- 
able sets are simply those measurable subsets of X which lie in E , and the 
measure is unchanged, except that its domain is restricted. This shows again that 
as soon as we have integration defined over every measure space, we automati- 
cally have it defined over every measurable subset of every measure space. 

1.25 Proposition Let s and t be nonnegative measurable simple functions on X. 

For E e 901, define 


-!■ 


cp{E) = s dp. 


(i) 


Then (p is a measure on 901. Also 


I 1 


(s + t) dfi = 


s dn + 


I' 


(2) 


(This proposition contains provisional forms of Theorems 1.27 and 1.29.) 


Proof If s is as in Definition 1.23, and if E l9 E 2 , . . . are disjoint members of 
901 whose union is E , the countable additivity of p shows that 


<P(E) = X oitHiAi n E) = X <*; Z v( A i n E r ) 


= 1 r = 1 


= I 


n 


oo 


n E r ) = Z<p(E r ). 

i = 1 r = 1 


Also, cp(0 ) = 0, so that cp is not identically oo. 
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Next, let s be as before, let , j8 m be the distinct values of r, and let 

Bj = {x: t(x) = /?,•}. If Eij = 4, n B j9 then 


(s + t)dn = («i + PjMEij) 


sdfi+ \ t dfi = aifi(Ejj) + fij/^Ey). 


Thus (2) holds with E tj in place of X. Since X is the disjoint union of the sets 
E^ (1 < i < n, 1 < j < m), the first half of our proposition implies that (2) 
holds. 1111 


We now come to the interesting part of the theory. One of its most remark- 
able features is the ease with which it handles limit operations. 

1.26 Lebesgue’s Monotone Convergence Theorem Let {/„} be a sequence of 
measurable functions on X , and suppose that 

(a) 0 < fi(x) < f 2 (x) < • • • < oo for every xgI, 

(b) /„(*)—►/(*) asn^> oo, for every x e X. 

Then f is measurable , and 


f n dp^> f dp as n— > oo. 


Proof Since J/„ < f /„ + 1 , there exists an a e [0, oo] such that 


► a as n— ► oo. 


By Theorem 1.14, /is measurable. Since /„ < / we have J/„ < J/for every n, 
so (1) implies 

a<|/rf/x. (2) 

Let s be any simple measurable function such that 0 < s < f let c be a 
constant, 0 < c < 1, and define 

E n = {x : /„(x) > cs(x)} (n = 1, 2, 3, . . .). (3) 

Each E n is measurable, <= E 2 <=■ E 3 a • , and X = [j E n . To see this 
equality, consider some x e X. If f(x) = 0, then x e E l ; if /(x) > 0, then 
cs(x) < / (x), since c < 1 ; hence x e E n for some n. Also 


f n dp > /„ dp > c \ s dp (n = 1, 2, 3, . . .). 
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Let n— ► oo, applying Proposition 1.25 and Theorem 1.19 (d) to the last inte- 
gral in (4). The result is 


cc> c j* s dfi. 


Since (5) holds for every c < 1, we have 


s i\i 


for every simple measurable s satisfying 0 < s < f so that 


*4 


a ^ J f dfi. 


The theorem follows from (1), (2), and (7). 

1.27 Theorem Iff n : X — > [0, oo] is measurable, for n= 1, 2, 3, . . . , and 

/(*)= £/„(*) (xeX), 

n= 1 


(5) 

( 6 ) 

(7) 

//// 

( 1 ) 


then 



(2) 


Proof First, there are sequences {s$, {s'/} of simple measurable functions 
such that sj— and s/-^/ 2 , as in Theorem 1.17. If s t = s' t + s/, then 
Si — ► /i +/ 2 , and the monotone convergence theorem, combined with Propo- 
sition 1.25, shows that 


f (/i +fi) dp — f /i dii + f / 2 d/L 

i/X %/x 


(3) 


Next, put ^ =/i + ' * ' +/jv The sequence {gf^} converges monotoni- 
cally to f and if we apply induction to (3) we see that 



(4) 


Applying the monotone convergence theorem once more, we obtain (2), and 
the proof is complete. //// 


If we let ii be the counting measure on a countable set, Theorem 1.27 is a 
statement about double series of nonnegative real numbers (which can of course 
be proved by more elementary means): 
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Corollary If > 0 for i and j = 1, 2, 3, ... , then 


Z Z z«u- 

i — 1 j=l j=l i — 1 

1.28 Fa toil’s Lemma If f n : X— ► [0, oo] is measurable, for each positive integer 
n , t/ten 



dp < lim inf 

«-* 00 



( 1 ) 


Strict inequality can occur in (1); see Exercise 8. 
Proof Put 


g k (x) = inf f{x) (k = 1, 2, 3, . . . ; x e X). (2) 

i>k 


Then g k <f k , so that 


I 


9k dfi < 


f k dn (k= 1,2,3,...). 


(3) 


Also, 0 < 0 X < 0 2 < each gu is measurable, by Theorem 1.14, and 
0 fc (x)— ► lim inf f n (x) as fc— ► oo, by Definition 1.13. The monotone convergence 
theorem shows therefore that the left side of (3) tends to the left side of (1), as 
fc— ► oo. Hence (1) follows from (3). //// 


1.29 Theorem Suppose f: X-+ [0, oo] is measurable , and 
<p(E) = j^/ dfi (Eem). 


Then q> is a measure on and 



for every measurable g on X with range in [0, oo]. 


(1) 

(2) 


Proof Let E u E 2 , E 3 , ... be disjoint members of whose union is E. 
Observe that 


Xe /= Z X Ej f (3) 

j= i 

and that 


</>(£) = 


[ 


X E fdn, 


cp(Ej) = 


I 


Xsjfdn- 


(4) 
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It now follows from Theorem 1.27 that 

<p(E) = t <P(Ej)- (5) 

j= 1 

Since cp(0) = 0, (5) proves that cp is a measure. 

Next, (1) shows that (2) holds whenever g = Xe f° r some E e 2R. Hence 
(2) holds for every simple measurable function g , and the general case follows 
from the monotone convergence theorem. //// 

Remark The second assertion of Theorem 1.29 is sometimes written in the 
form 

dcp=fdg . (6) 

We assign no independent meaning to the symbols dcp and dg; (6) merely 
means that (2) holds for every measurable g > 0. 

Theorem 1.29 has a very important converse, the Radon-Nikodym 
theorem, which will be proved in Chap. 6. 


Integration of Complex Functions 

As before, g will in this section be a positive measure on an arbitrary measurable 
space X. 

1.30 Definition We define l}(g) to be the collection of all complex measur- 
able functions / on X for which 


J |/| dg < oo. 

Note that the measurability of / implies that of | / 1 , as we saw in Propo- 
sition 1.9(h); hence the above integral is defined. 

The members of I}(g) are called Lebesgue integrable functions (with 
respect to g) or summable functions. The significance of the exponent 1 will 
become clear in Chap. 3. 

1.31 Definition If/ = u + iv, where u and v are real measurable functions on 
X , and if f e I}(g), we define 




dg 4- i \ v + dg — i 



(1) 


for every measurable set E. 

Here u + and u~ are the positive and negative parts of m, as defined in 
Sec. 1.15; v + and v~ are similarly obtained from v. These four functions are 
measurable, real, and nonnegative; hence the four integrals on the right of (1) 
exist, by Definition 1.23. Furthermore, we have u + <\u \ < |/|, etc., so that 
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each of these four integrals is finite. Thus (1) defines the integral on the left as 
a complex number. 

Occasionally it is desirable to define the integral of a measurable func- 
tion / with range in [ — oo, oo] to be 

[fdn = f/ + dn - [/” dn, (2) 

JE Je Je 

provided that at least one of the integrals on the right of (2) is finite. The left 
side of (2) is then a number in [ — oo, oo]. 


1.32 Theorem Suppose f and g e l}(p) and a and p are complex numbers . Then 
a/+ fig e and 


I (oc/ + Pg)dfi = a f f dg + 

JX JX 


9 dfi. 


(1) 


Proof The measurability of a if 4- fig follows from Proposition 1.9(c). By Sec. 
1.24 and Theorem 1.27, 


I a/ + P9 1 dn 


I l/l + l)3| \g\)dn 


£ ["“ 

= I a I [ I/I dn + \P\ f \g\dn<QC. 
JX Jx 


Thus <xf + fig e l}(p). 

To prove (1), it is clearly sufficient to prove 


and 


[ (/+ 9 )dp= f f dp + f g dp 
= a j* fdfi. 


(2) 


(a/) dn 


( 3 ) 


and the general case of (2) will follow if we prove (2) for real / and g in l}(p). 
Assuming this, and setting h = f 4- g, we have 


or 


h + ~ h~ =f + — f~ + g + — g~ 
4 -/" 4- g~ =/ + 4- g + 4- h~ 


By Theorem 1.27, 


J^ + J/-+J.- = Jr + J^+J*-, 


( 4 ) 


(5) 


and since each of these integrals is finite, we may transpose and obtain (2). 
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That (3) holds if a > 0 follows from Proposition 1.24(c). It is easy to 
verify that (3) holds if a = —1, using relations like ( — u) + = w". The case 
a = i is also easy: If/ = u + it;, then 


(% (% * r r 

(if) = (iw - y) = (-t;) + M w = - i; 


+ i 



— i 




f 


Combining these cases with (2), we obtain (3) for any complex a. //// 


1.33 Theorem ///e L 1 ^), t/icn 



/• 

< 

Jx 


I/I 


Proof Put z = j* / Since z is a complex number, there is a complex 
number a, with | a | = 1, such that az = | z | . Let u be the real part of a/ Then 
u < | a/| = |/| - Hence 



= a 



a/ du = I w tiu < 

Jx Jx 




The third of the above equalities holds since the preceding ones show that 
| a f du is real. I HI 


We conclude this section with another important convergence theorem. 


1.34 Lebesgue’s Dominated Convergence Theorem Suppose {/„} is a sequence 
of complex measurable functions on X such that 

f(x) = lim f„(x) (1) 

n~* oo 

exists for every x e X. If there is a function g e I}(p) such that 

| f n (x) | < g(x) (n = 1, 2, 3, . . . ; x e X), (2) 


then f e l}(p\ 


and 


lim 

n~* oo 


lim 

n~* oo 


[l/.-/l^ = 0. 
f f n dli= f fdn. 

JX JX 


( 3 ) 


( 4 ) 
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Proof Since |/| < g and / is measurable, / e l}(g). Since |/„ — /| < 2g, 
Fatou’s lemma applies to the functions 2g — \ f n — f | and yields 


I 


2g dg < lim inf 


(2g-\f n -f\)dg 


2g dg + lim inf 


-i 


2g dg — lim sup 






Since J 2g dg is finite, we may subtract it and obtain 


lim sup 


L' f -~ 


f\dg<0. 


( 5 ) 


If a sequence of nonnegative real numbers fails to converge to 0, then its 
upper limit is positive. Thus (5) implies (3). By Theorem 1.33, applied to 
/„-/( 3) implies (4). //// 


The Role Played by Sets of Measure Zero 

1.35 Definition Let P be a property which a point x may or may not have. 
For instance, P might be the property “/ (x) > 0” if /is a given function, or it 
might be “ {/„(x)} converges” if {/„} is a given sequence of functions. 

If g is a measure on a a-algebra and if E e 9W, the statement “ P holds 
almost everywhere on £” (abbreviated to “ P holds a.e. on £”) means that 
there exists an N e such that g(N) = 0, iV c £, and P holds at every point 
of E — N. This concept of a.e. depends of course very strongly on the given 
measure, and we shall write “ a.e. [/i] ” whenever clarity requires that the 
measure be indicated. 

For example, if/ and g are measurable functions and if 

H({x:f(x) # g(x)}) = 0, (1) 

we say that / = g a.e. [ g ] on X 9 and we may write / ~ g. This is easily seen to 
be an equivalence relation. The transitivity (/ ~ g and g ~ h implies / ~ h) 
is a consequence of the fact that the union of two sets of measure 0 has 
measure 0. 

Note that if/ ~ g, then, for every E e 

[fdn=\gd(L (2) 

JE JE 

To see this, let N be the set which appears in (1); then E is the union of the 
disjoint sets E — N and E n N; on E — N,f = g , and g(E n N) = 0. 
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Thus, generally speaking, sets of measure 0 are negligible in integration. 
It ought to be true that every subset of a negligible set is negligible. But it 
may happen that some set N e 901 with p(N) = 0 has a subset E which is not 
a member of 901. Of course we can define p(E) = 0 in this case. But will this 
extension of p still be a measure, i.e., will it still be defined on a a-algebra? It 
is a pleasant fact that the answer is affirmative : 

1.36 Theorem Let ( X , 901, p) be a measure space , let 901* be the collection of all 
E c= X for which there exist sets A and Be 901 such that A c= E c= B and 
p(B — A) = 0, and define p(E) = p(A) in this situation. Then 901* is a a-algebra , 
and p is a measure on 901*. 

This extended measure p is called complete , since all subsets of sets of 
measure 0 are now measurable; the a-algebra 901* is called the p-completion of 901. 
The theorem says that every measure can be completed, so, whenever it is conve- 
nient, we may assume that any given measure is complete; this just gives us more 
measurable sets, hence more measurable functions. Most measures that one 
meets in the ordinary course of events are already complete, but there are excep- 
tions; one of these will occur in the proof of Fubini’s theorem in Chap. 8. 

Proof We begin by checking that p is well defined for every E e 901*. 
Suppose A c= E cz B, A t c~ E c= B l9 and p(B — A) = p(B x — Af) = 0. (The 
letters A and B will denote members of 901 throughout this proof.) Since 

A - A l c= E - A x c= B x - A x 

we have p(A — Af) = 0, hence p(A) = p(A n A x ). For the same reason, 
KA i) = n A). We conclude that indeed p{A x ) = p(A). 

Next, let us verify that 901* has the three defining properties of a a- 
algebra. 

(i) X e 901*, because X e 901 and 901 <= 901*. 

(ii) If A c E c B then B c <= E c c= A c . Thus E e 901* implies E c e 901*, because 
A c — B c — A c n B = B — A. 

(iii) If A t c= E t c Bf, E = E i9 A = A i9 B = B £ , then A a E c B and 

B-A = \J(B t -A)cz\J {B t - A,). 

1 1 

Since countable unions of sets of measure zero have measure zero, it follows 

that E e 901* if E ( e 901* for i = 1, 2, 3, 

Finally, if the sets E t are disjoint in step (iii), the same is true of the sets 
A i9 and we conclude that 

H(E) = n(A) = f n(Ad = f n(Ed. 

1 1 

This proves that p is countably additive on 901*. 


Ill / 
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1.37 The fact that functions which are equal a.e. are indistinguishable as far as 
integration is concerned suggests that our definition of measurable function 
might profitably be enlarged. Let us call a function / defined on a set £ g 901 
measurable on X if p(E c ) = 0 and if n £ is measurable for every open set 
V. If we define /(x) = 0 for x g £ c , we obtain a measurable function on X , in the 
old sense. If our measure happens to be complete, we can define / on E c in a 
perfectly arbitrary manner, and we still get a measurable function. The integral of 
/ over any set A g 901 is independent of the definition of / on £ c ; therefore this 
definition need not even be specified at all. 

There are many situations where this occurs naturally. For instance, a func- 
tion / on the real line may be differentiable only almost everywhere (with respect 
to Lebesgue measure), but under certain conditions it is still true that / is the 
integral of its derivative; this will be discussed in Chap. 7. Or a sequence {/„} of 
measurable functions on X may converge only almost everywhere; with our new 
definition of measurability, the limit is still a measurable function on X 9 and we 
do not have to cut down to the set on which convergence actually occurs. 

To illustrate, let us state a corollary of Lebesgue’s dominated convergence 
theorem in a form in which exceptional sets of measure zero are admitted: 

1.38 Theorem Suppose {/„} is a sequence of complex measurable functions 
defined a.e. on X such that 



\f n \dp<co. 


(1) 


Then the series 


f(x) = Z f„(x) (2) 

n = 1 

converges for almost all x,/e l}(p\ and 

f fdn = £ f /„ d/x. (3) 

JX n = 1 JX 

Proof Let S n be the set on which /„ is defined, so that p(S c n ) = 0. Put (p(x ) = 
Z I fn( x ) I , for x g S = P) S n . Then p(S c ) = 0. By (1) and Theorem 1.27, 

/% 

cp dp < oo. (4) 

Js 

If E = {x g S: cp(x) < oo}, it follows from (4) that p(E c ) = 0. The series (2) 
converges absolutely for every x g E, and if /(x) is defined by (2) for x e £, 
then |/(x)| < cp(x) on £, so that f g L\p) on £, by (4). If g n =h + • • • +/„, 
then \g n \ < cp, g n (x)^> f(x) for all x e £, and Theorem 1.34 gives (3) with £ in 
place of X. This is equivalent to (3), since p(E c ) — 0. I HI 
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Note that even if the f n were defined at every point of X, (1) would only imply 
that (2) converges almost everywhere. Here are some other situations in which we 
can draw conclusions only almost everywhere: 

1.39 Theorem 

(a) Suppose f: X — ► [0, oo] is measurable , E e and \ E f dp = 0. Then / = 0 
a.e. on E. 

(b) Suppose f e l}(p) and Je f dp = 0 for every E e Then f = 0 a.e. on X. 

(c) Suppose f e l}(p) and 



-l 


l/l dfi. 


Then there is a constant a such that of = \f\ a.e. on X. 


Note that (c) describes the condition under which equality holds in Theorem 
1.33. 

Proof 

(a) If A n = {x e E: f(x) > l/n}, n- 1, 2, 3, ... , then 

- n{A n ) < ) / dn < I f dn = 0, 

n JA„ JE 

so that p(A n ) = 0. Since {x e E:f(x) > 0} = (J A n , (a) follows. 

(b) Put f — u iv, let E = {x: u(x) > 0}. The real part of J £ / dp is then 
J £ u + dp. Hence J £ u + dp = 0, and (a) implies that u + = 0 a.e. We con- 
clude similarly that 

u~ = v + = v~ = 0 a.e. 

(c) Examine the proof of Theorem 1.33. Our present assumption implies that 

the last inequality in the proof of Theorem 1.33 must actually be an 
equality. Hence J ( I / 1 — u) dp — 0. Since | / 1 — u > 0, (a) shows that 
| / 1 = u a.e. This says that the real part of of is equal to | of | a.e., hence 
a / = : I a / 1 = I / 1 a.e., which is the desired conclusion. //// 

1.40 Theorem Suppose p(X) < oo, / e l}(p), S is a closed set in the complex 
plane , and the averages 




lie in S for every E e with p(E) > 0. Then f(x) e S for almost all x e X. 
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Proof Let A be a closed circular disc (with center at a and radius r > 0, say) 
in the complement of 5. Since S c is the union of countably many such discs, it 
is enough to prove that g(E) = 0, where E — f~ l ( A). 

If we had g(E) > 0, then 

which is impossible, since A^f) e S. Hence n(E) = 0. //// 


f lf ~ 

ME) Je 


a | d\i < r, 


1.41 Theorem Let {E k } be a sequence of measurable sets in X, such that 

Z ME*) < CO- (!) 

*= 1 

Tfren almost all x e X lie in at most finitely many of the sets E k . 


Proof If A is the set of all x which lie in infinitely many E ki we have to 
prove that n(A) = 0. Put 


9{x)= Z X Ek (x) (x e X). (2) 

k=l 

For each x, each term in this series is either 0 or 1. Hence x e A if and only if 
g(x) = oo. By Theorem 1.27, the integral of g over X is equal to the sum in 
(1). Thus g e l}(fi), and so g(x) < oo a.e. //// 


Exercises 

1 Does there exist an infinite <r-algebra which has only countably many members? 

2 Prove an analogue of Theorem 1.8 for n functions. 

3 Prove that if/ is a real function on a measurable space X such that {x: /(x) > r} is measurable for 
every rational r, then /is measurable. 

4 Let {a„} and {b n } be sequences in [ — oo, oo], and prove the following assertions: 

(a) lim sup ( — a n ) = — lim inf a n . 

n~* oo n -» oo 

(i b ) lim sup ( a n + b n ) < lim sup a n + lim sup b H 

w-*ao n~* oo oo 

provided none of the sums is of the form oo — oo. 

(c) If a n <, b n for all n, then 

lim inf a n <, lim inf b n . 


Show by an example that strict inequality can hold in (b). 
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5 (a) Suppose /: X— > [— oo, oo] and g: X— > [— oo, oo] are measurable. Prove that the sets 

{*:/(*) < g(x)}, {*:/(*) = g{x)} 


are measurable. 

(b) Prove that the set of points at which a sequence of measurable real-valued functions con- 
verges (to a finite limit) is measurable. 

6 Let X be an uncountable set, let be the collection of all sets E <= X such that either E or E c is at 
most countable, and define n(E) = 0 in the first case, n(E) = 1 in the second. Prove that is a 
(T-algebra in X and that n is a measure on Describe the corresponding measurable functions and 
their integrals. 

7 Suppose /„: X— ► [0, oo] is measurable for n — 1, 2, 3, ..., f i >f 2 > / 3 > • • • > 0, /„(*)— >f(x) as 
n— ► oo, for every x e X, and f l e l}(n). Prove that then 

lim f /„ d\i = j fdg. 

B-00 JX JX 

and show that this conclusion does not follow if the condition e L 1 ^)” is omitted. 

8 Put/. = X E if w is odd,/„ = 1 — if n is even. What is the relevance of this example to Fatou’s 
lemma? 

9 Suppose n is a positive measure on X,f: X-> [0, oo] is measurable, / dg. = c, where 0 < c < oo, 
and a is a constant. Prove that 


lim 

n~* oo 


1 , 


n log [1 + (//«)“] dn = 


r OO 

c 

0 


if 0 < a < 1, 
if a = 1, 
if 1 < a < oo. 


Hint : If a > 1, the integrands are dominated by a/ If a < 1, Fatou’s lemma can be applied. 

10 Suppose n{X) < oo, {/„} is a sequence of bounded complex measurable functions on X, an d/„— ►/ 
uniformly on X. Prove that 


lim 

n~ oo 



and show that the hypothesis “ n{X) < oo ” cannot be omitted. 
11 Show that 


a = n u £* 

n= 1 k = n 

in Theorem 1.41, and hence prove the theorem without any reference to integration. 

12 Suppose / e l}{n). Prove that to each € > 0 there exists a 3 > 0 such that \ E \f\ dn < e whenever 

m < *• 

13 Show that proposition 1.24 (c) is also true when c = oo. 
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POSITIVE BOREL MEASURES 


Vector Spaces 

2.1 Definition A complex vector space (or a vector space over the complex 
field) is a set V, whose elements are called vectors and in which two oper- 
ations, called addition and scalar multiplication , are defined, with the follow- 
ing familiar algebraic properties : 

To every pair of vectors x and y there corresponds a vector x 4- y, in 
such a way that x + y = y + x and x + (y + z) = (x + y) + z; V contains a 
unique vector 0 (the zero vector or origin of V) such that x -1- 0 = x for every 
x e V; and to each x e V there corresponds a unique vector — x such that 
x -1- ( — x) = 0. 

To each pair (a, x), where x e V and a is a scalar (in this context, the 
word scalar means complex number ), there is associated a vector ax e V, in 
such a way that lx = x, a(fix) = (a/?)x, and such that the two distributive 
laws 


a(x + y) = ocx + ay, (a 4- p)x = ax -1- fix 


(1) 


hold. 

A linear transformation of a vector space V into a vector space is a 
mapping A of V into V t such that 

A(ax -1- Py) = ocAx -1- PAy (2) 

for all x and ye F and for all scalars a and P . In the special case in which 
is the field of scalars (this is the simplest example of a vector space, except for 
the trivial one consisting of 0 alone), A is called a linear functional A linear 
functional is thus a complex function on V which satisfies (2). 

Note that one often writes Ax, rather than A(x), if A is linear. 


33 
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The preceding definitions can of course be made equally well with any field 
whatsoever in place of the complex field. Unless the contrary is explicitly stated, 
however, all vector spaces occurring in this book will be complex, with one 
notable exception: the euclidean spaces R k are vector spaces over the real field. 


2.2 Integration as a Linear Functional Analysis is full of vector spaces and linear 
transformations, and there is an especially close relationship between integration 
on the one hand and linear functionals on the other. 

For instance, Theorem 1.32 shows that l}(p) is a vector space, for any posi- 
tive measure )U, and that the mapping 


/-[/^ (i) 

is a linear functional on l}(p). Similarly, if g is any bounded measurable function, 
the mapping 

/— J ' fy dn (2) 

is a linear functional on l}(p); we shall see in Chap. 6 that the functionals (2) are, 
in a sense, the only interesting ones on l}(p). 

For another example, let C be the set of all continuous complex functions on 
the unit interval / = [0, 1]. The sum of the two continuous functions is contin- 
uous, and so is any scalar multiple of a continuous function. Hence C is a vector 
space, and if 


A / = 


J f(x)dx 


(/e C), 


( 3 ) 


the integral being the ordinary Riemann integral, then A is clearly a linear func- 
tional on C; A has an additional interesting property: it is a positive linear func- 
tional This means that A/ > 0 whenever / > 0. 

One of the tasks which is still ahead of us is the construction of the Lebesgue 
measure. The construction can be based on the linear functional (3), by the fol- 
lowing observation: Consider a segment (a, b) / and consider the class of all 
f e C such that 0 </< 1 on / and f(x) — 0 for all x not in (a, b). We have 
A/ < b — a for all such f but we can choose / so that A f is as close to b — a as 
desired. Thus the length (or measure) of (a, b) is intimately related to the values of 
the functional A. 

The preceding observation, when looked at from a more general point of 
view, leads to a remarkable and extremely important theorem of F. Riesz: 

To every positive linear functional A on C corresponds a finite positive Borel 
measure p on I such that 


/% 

A/= fdn (/e Q. 

Ji 


(4) 
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[The converse is obvious: if \i is a finite positive Borel measure on / and if A is 
defined by (4), then A is a positive linear functional on C.] 

It is clearly of interest to replace the bounded interval I by R 1 . We can do 
this by restricting attention to those continuous functions on R 1 which vanish 
outside some bounded interval. (These functions are Riemann integrable, for 
instance.) Next, functions of several variables occur frequently in analysis. Thus 
we ought to move from R 1 to R". It turns out that the proof of the Riesz theorem 
still goes through, with hardly any changes. Moreover, it turns out that the 
euclidean properties of R n (coordinates, orthogonality, etc.) play no role in the 
proof; in fact, if one thinks of them too much they just get in the way. Essential 
to the proof are certain topological properties of R n . (Naturally. We are now 
dealing with continuous functions.) The crucial property is that of local com - 
pactness : Each point of R n has a neighborhood whose closure is compact. 

We shall therefore establish the Riesz theorem in a very general setting 
(Theorem 2.14). The existence of Lebesgue measure follows then as a special case. 
Those who wish to concentrate on a more concrete situation may skip lightly 
over the following section on topological preliminaries (Urysohn’s lemma is the 
item of greatest interest there; see Exercise 3) and may replace locally compact 
Hausdorff spaces by locally compact metric spaces, or even by euclidean spaces, 
without missing any of the principal ideas. 

It should also be mentioned that there are situations, especially in probability 
theory, where measures occur naturally on spaces without topology, or on topo- 
logical spaces that are not locally compact. An example is the so-called Wiener 
measure which assigns numbers to certain sets of continuous functions and which 
is a basic tool in the study of Brownian motion. These topics will not be dis- 
cussed in this book. 

Topological Preliminaries 

2.3 Definitions Let X be a topological space, as defined in Sec. 1.2. 

(a) A set E a X is closed if its complement E c is open. (Hence 0 and X are 
closed, finite unions of closed sets are closed, and arbitrary intersections 
of closed sets are closed.) 

(b) The closure E of a set E a X is the smallest closed set in X which con- 
tains E. (The following argument proves the existence of E: The collec- 
tion Q of all closed subsets of X which contain E is not empty, since 
X e Q; let E be the intersection of all members of Q.) 

(c) A set K a X is compact if every open cover of K contains a finite sub- 
cover. More explicitly, the requirement is that if { is a collection of 
open sets whose union contains R, then the union of some finite sub- 
collection of { V a } also contains K. 

In particular, if X is itself compact, then X is called a compact space. 

(< d ) A neighborhood of a point p e X is any open subset of X which contains 
p. (The use of this term is not quite standardized; some use 
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“neighborhood of p” for any set which contains an open set containing 

P) 

(e) X is a. Hausdorff space if the following is true: If p e X 9 q e X 9 and p ^ q, 
then p has a neighborhood U and q has a neighborhood V such that 
U n V = 0. 

(/) X is locally compact if every point of X has a neighborhood whose 
closure is compact. 

Obviously, every compact space is locally compact. 

We recall the Heine-Borel theorem: The compact subsets of a euclidean space 
R n are precisely those that are closed and bounded ([26], f Theorem 2.41). From 
this it follows easily that R n is a locally compact Hausdorff space. Also, every 
metric space is a Hausdorff space. 

2.4 Theorem Suppose K is compact and F is closed , in a topological space X. 
If F cz K, then F is compact. 

Proof If {J^} is an open cover of F and W — F c , then W u (J a V a covers X; 
hence there is a finite collection { such that 

K cz W u K ai u ■ • • u V an . 

Then Fez V ai u • • • u K n . Ill I 

Corollary If A cz B and if B has compact closure , so does A. 

2.5 Theorem Suppose X is a Hausdorff space , K cz X, K is compact , and 
p e K c . Then there are open sets U and W such that p e U, K cz W, and 
U n W = 0. 

Proof If q e K, the Hausdorff separation axiom implies the existence of dis- 
joint open sets U q and V q , such that p e U q and q e V q . Since K is compact, 
there are points q l9 ... 9 q n e K such that 

K^V ti u-uF,. 

Our requirements are then satisfied by the sets 

U = U<n n n U qn and W= V qi u ••• u V qn . //// 

Corollaries 

(a) Compact subsets of Hausdorff spaces are closed. 

(b) If F is closed and K is compact in a Hausdorff space , then F n K is 
compact. 

Corollary (b) follows from (a) and Theorem 2.4. 


t Numbers in brackets refer to the Bibliography. 
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2.6 Theorem If { K a } is a collection of compact subsets of a Hausdorff space 
and if f) a K a = 0, then some finite subcollection of { K a } also has empty inter- 
section. 

Proof Put V a = K c a . Fix a member of {K a }. Since no point of K t belongs 
to every K a , {V a } is an open cover of K v Hence a V ai u ••• u V an for 
some finite collection {V a f. This implies that 

K 1 nK. 1 n---nK* = 0. //// 

2.7 Theorem Suppose U is open in a locally compact Hausdorff space X , 
K a U, and K is compact. Then there is an open set V with compact closure 
such that 


KcK c V <=U. 

Proof Since every point of K has a neighborhood with compact closure, 
and since K is covered by the union of finitely many of these neighborhoods, 
K lies in an open set G with compact closure. If U = X 9 take V = G. 

Otherwise, let C be the complement of U. Theorem 2.5 shows that_to 
each p g C there corresponds an open set W p such that K a W p and p $ W p . 
Hence {C n G n W p } 9 where p ranges over C, is a collection of compact sets 
with empty intersection. By Theorem 2.6 there are points p i9 p n g C such 
that 

CnGnW pi n--nW Pn = 0. 

The set 


V = G n W pi n • • • n W Pn 
then has the required properties, since 

V^GnW pi n--- nW Pn . //// 

2.8 Definition Let / be a real (or extended-real) function on a topological 
space. If 

{x\f(x) > a} 

is open for every real a, /is said to be lower semicontinuous. If 

{*:/(*) < «} 

is open for every real a, /is said to be upper semicontinuous. 

A real function is obviously continuous if and only if it is both upper and 
lower semicontinuous. 

The simplest examples of semicontinuity are furnished by characteristic func- 
tions : 
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(a) Characteristic functions of open sets are lower semicontinuous. 

(b) Characteristic functions of closed sets are upper semicontinuous. 

The following property is an almost immediate consequence of the defini- 
tions: 

(c) The supremum of any collection of lower semicontinuous functions is lower 
semicontinuous. The infimum of any collection of upper semicontinuous func- 
tions is upper semicontinuous. 

2.9 Definition The support of a complex function / on a topological space X 
is the closure of the set 


{x:/(x) # 0}. 

The collection of all continuous complex functions on X whose support is 
compact is denoted by C c (X). 

Observe that C c (X) is a vector space. This is due to two facts: 

(a) The support of / + g lies in the union of the support of / and the support 
of g , and any finite union of compact sets is compact. 

(b) The sum of two continuous complex functions is continuous, as are 
scalar multiples of continuous functions. 

(Statement and proof of Theorem 1.8 hold verbatim if “ measurable function” is 
replaced by “continuous function,” “measurable space” by “topological space”; 
take d>(s, t) = s + t, or 0(s, t) = st, to prove that sums and products of contin- 
uous functions are continuous.) 

2.10 Theorem Let X and Y be topological spaces , and let f: X^Y be contin- 
uous. If K is a compact subset of X , then f(K) is compact. 

Proof If {V a } is an open cover of f(K ), then {f~ i {V a )} is an open cover of K , 
hence K <^f~ l (V ai ) u • • • u f~ l {V a ) for some ol u ..., a„, and therefore 
f(K)^ V ai u ••• u V an . " //// 

Corollary The range of any f e C c (X) is a compact subset of the complex 
plane. 

In fact, if K is the support of / e C c (X) 9 then f(X) a f(K) u {0}. If X is not 
compact, then 0 e f(X\ but 0 need not lie in f(K\ as is seen by easy examples. 

2.11 Notation In this chapter the following conventions will be used. The 
notation 


K<f 


( 1 ) 



POSITIVE BOREL MEASURES 39 


will mean that K is a compact subset of X , that / e C c (X), that 0 </(x) < 1 
for all xel, and that /(x) = 1 for all x g K. The notation 

f< V (2) 

will mean that V is open, that / e C c (X\ 0 </< 1, and that the support of / 
lies in V. The notation 


K <f< V (3) 

will be used to indicate that both (1) and (2) hold. 

2.12 Urysohn’s Lemma Suppose X is a locally compact Hausdorff space , V is 
open in X, K c= V 9 and K is compact. Then there exists an f e C c (X\ such that 

K<f<V. (1) 


In terms of characteristic functions, the conclusion asserts the existence of a 
continuous function f which satisfies the inequalities x* </ < . Note that it is 

easy to find semicontinuous functions which do this ; examples are Xk and Xv • 


Proof Put = 0, r 2 = 1, and let r 3 , r 4 , r 5 , ... be an enumeration of the 
rationals in (0, 1). By Theorem 2.7, we can find open sets V 0 and then V t such 
that V 0 is compact and 

K <= Kj <= Pj <= V 0 <= V 0 <= V. (2) 


Suppose n > 2 and V ri , . . . , V rn have been chosen in such a manner that 
r t < Yj implies V r . a V r . . Then one of the numbers r l9 . . . , r n , say r t , will be the 
largest one which is smaller than r n + 1 , and another, say r j9 will be the smal- 
lest one larger than r n + l . Using Theorem 2.7 again, we can find V rn+l so that 


Continuing, we obtain a collection {1^.} of open sets, one for every 


rational r e [0, 1], with 
compact , and 

the following properties: K 

<= V u V 0 <= V, each V r is 


s > r implies V s a V r . 

( 3 ) 

Define 




II 

if x e V r , 
otherwise, 


ifxeP - (4) 

otherwise, 

and 

/= sup f r , 

r 

g = inf g s . 

s 

( 5 ) 


The remarks following Definition 2.8 show that / is lower semi- 
continuous and that g is upper semicontinuous. It is clear that 0 </< 1, that 
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f(x) = 1 if x g K, and that / has its support in V 0 . The proof will be com- 
pleted by showing that / = g. 

The inequality f r (x) > g s (x) is possible only if r > s, x e V r , and x $ V s . 
But r > s implies V r a V s . Hence f r < g s for all r and s, so / < g. 

Suppose / (x) < g(x) for some x. Then there are rationals r and s such 
that f(x)<r<s< g(x). Since /(x) < r, we have x £ F r ; since #(x) > s, we 
have x e V s . By (3), this is a contradiction. Hence / = g. //// 

2.13 Theorem Suppose V lf V„ are open subsets of a locally compact Haus- 
dorjf space X , K is compact , and 

K c V t u ••• u V n . 

Then there exist functions h t -< ^ (i = 1, . . . , n) such that 

Mx) + * • • + M*) =1 (xe K). (1) 

Because of (1), the collection {h l , . h n } is called a partition of unity on K , 
subordinate to the cover { . . . , V n }. 

Proof By Theorem 2.7, each x e K has a neighborhood W x with compact 
closure W x cz V t for some i (depending on x). There are points x u . . . , x m such 
that W XI u • • • u W Xm => K. If 1 < i < n, let H t be the union of those 
W Xj which lie in V t . By Urysohn’s lemma, there are functions g t such that 
Hi < g t < Vi - Define 

hi =g i 

h 2 = (1 - Q i)G 2 

(2) 

K = {\- gi){l - g 2 ) * * • (1 ~ 0n- l)^n • 

Then h t *< K f . It is easily verified, by induction, that 

+ h 2 H- • • * + h n = 1 — (1 — <7i)(l — gf) • * • (1 — g n )- (3) 

Since K cz H x u • u H n , at least one 0 f (x) = 1 at each point x e K\ hence 

(3) shows that (1) holds. //// 

The Riesz Representation Theorem 

2.14 Theorem Let X be a locally compact Hausdorff space , and let A be a 
positive linear functional on C c (X). Then there exists a a-algebra 501 in X which 
contains all Borel sets in X , and there exists a unique positive measure p on 501 
which represents A in the sense that 

(a) A f = j x f dn for every f 6 C c (X), 

and which has the following additional properties: 
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(b) p(K) < oo for every compact set K c= X. 

(c) For every E e 901, we have 

p(E) = inf {p(V): E c V, V open}. 

(d) The relation 

p(E) = sup {p(K ) : K c= £, K compact} 

holds for every open set £, and /or £ e 2ft with p(E) < oo. 

(e) If E e 2ft, A c= £, and /i(£) = 0, r/ien >4 e 2ft. 

For the sake of clarity, let us be more explicit about the meaning of the word 
“positive” in the hypothesis: A is assumed to be a linear functional on the 
complex vector space C c (X), with the additional property that A/is a nonnegative 
real number for every / whose range consists of nonnegative real numbers. 
Briefly, if f(X) c= [0, oo) then A/ e [0, oo). 

Property (a) is of course the one of greatest interest. After we define 2ft and p, 
(b) to (d) will be established in the course of proving that 2ft is a a-algebra and 
that p is countably additive. We shall see later (Theorem 2.18) that in 
“reasonable” spaces X every Borel measure which satisfies (b) also satisfies (c) 
and (d) and that (d) actually holds for every E e 2ft, in those cases. Property (e) 
merely says that ( X , 2R, p) is a complete measure space, in the sense of Theorem 
1.36. 

Throughout the proof of this theorem, the letter K will stand for a compact 
subset of X, and V will denote an open set in X. 

Let us begin by proving the uniqueness of p. If p satisfies (c) and (d), it is clear 
that p is determined on 2ft by its values on compact sets. Hence it suffices to 
prove that p^K) = p 2 (K) for all K , whenever p x and p 2 are measures for which 
the theorem holds. So, fix K and e > 0. By (b) and (c), there exists a V K with 
p 2 (. F) < Pi(K) + €; by Urysohn’s lemma, there exists an / so that K <f < V; 
hence 


Mi (*) 


Xk dn i < ) fdHi = Af= f dn 2 
JX Jx 


~ \x Xv dfl2 = < + C ' 


Thus p^K) < p 2 {K). If we interchange the roles of p x and p 2 , the opposite 
inequality is obtained, and the uniqueness of p is proved. 

Incidentally, the above computation shows that (a) forces ( b ). 


Construction of p and 2ft 

For every open set V in X , define 

fi(V) = sup {Af:f< V }. 


(1) 
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If V x <= V 2 , it is clear that (1) implies pfV^ < p(V 2 ). Hence 

p(E) = inf {p(V): E a V, V open}, (2) 

if E is an open set, and it is consistent with (1) to define p(E) by (2), for every 
E^X. 

Note that although we have defined p(E) for every E cz X, the countable 
additivity of p will be proved only on a certain a-algebra in X. 

Let $tt F be the class of all E c= X which satisfy two conditions: p(E) < oo, and 

p(E) = sup {p(K): K c= E, K compact}. (3) 

Finally, let be the class of all E c= X such that E n K e $tt F for every compact 
K. 

Proof that p and have the required properties 

It is evident that p is monotone , i.e., that p{A) < p(B) if A a B and that 
p(E) = 0 implies E e $fl F and E e 90L Thus (e) holds, and so does (c), by defini- 
tion. 

Since the proof of the other assertions is rather long, it will be convenient to 
divide it into several steps. 

Observe that the positivity of A implies that A is monotone: f < g implies 
A f < Ag. This is clear, since Ag = Af + A (g — f) and g — f > 0. This monot- 
onicity will be used in Steps II and X. 

step I If E l9 E 2 , E 39 ... are arbitrary subsets of X, then 

( OO \ 00 

U E j * .1 (4) 


Proof We first show that 

p(Vi u V 2 ) < MFi) + p(V 2 \ (5) 

if V i and V 2 are open. Choose g < V t u V 2 . By Theorem 2.13 there are func- 
tions h x and h 2 such that h t -< V i and h x (x) 4- h 2 (x) = 1 for all x in the 
support of g. Hence h t g < V i9 g = h x g 4- h 2 g, and so 

Ag = A (h lG ) 4- A (h 2 g) < p(V,) 4- p(V 2 ). (6) 

Since (6) holds for every g <V X u V 2 , (5) follows. 

If piEi) = oo for some i, then (4) is trivially true. Suppose therefore that 
< oo for every i. Choose e > 0. By (2) there are open sets V t => E t such 

that 


MK) < + 2-*£ (i=l,2, 3,...). 
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Put V = [J™ V i9 and choose f <V. Since / has compact support, we see that 
/-< F x u • ♦ • u V n for some n. Applying induction to (5), we therefore obtain 

A/<M^1 U ••• u V n ) < AVJ + ■■■ + KV n ) < tli(Ei) + e. 

i = 1 

Since this holds for every f <V, and since (J E { a F, it follows that 

0 E ij ^ t4Y) ^ + e, 

which proves (4), since e was arbitrary. //// 

step II If K is compact , then K e $ft F and 

fi(K) = inf {A/: K <f). (7) 

This implies assertion ( b ) of the theorem. 

Proof If K and 0 < a < 1, let V a = {x: f(x ) > a}. Then K a V a , and 
a g <f whenever g -< V a . Hence 

fi(K) < fi(VJ = sup {Agr: g<V x } < a" ‘A/. 

Let a-> 1, to conclude that 

P(K) < A/. (8) 

Thus p(K) < oo. Since K evidently satisfies (3), K e 2ft F . 

If e > 0, there exists F => K with p(V) < p(K) + e. By Urysohn’s lemma, 
K <f < V for some /. Thus 

A g(V) < p(K) + 6, 

which, combined with (8), gives (7). I III 

step hi Every open set satisfies (3). Hence $W F contains every open set V with 
p(V) < oo. 

Proof Let a be a real number such that a < p(V). There exists an f <V with 
a < A/. If W is any open set which contains the support K of f then f 
hence A f < p(W). Thus A f < p(K). This exhibits a compact K a V with 
a < p(K ), so that (3) holds for F. //// 

step iv Suppose E = E t , where E u E 2 , E z , ... are pairwise disjoint members 
of m F . Then 

H(E)= Z KEd- 

i= 1 

If in addition , p(E) < oo, then also E e $W F . 


(9) 
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Proof We first show that 

KKi u K 2 ) = KKJ + ii(K 2 ) (10) 

if K t and K 2 are disjoint compact sets. Choose e > 0. By Urysohn’s lemma, 
there exists fe C c (X) such that f(x) = 1 on K u f(x) = 0 on K 2i and 
0 </< 1. By Step II there exists g such that 

u K 2 -< # and A# < u X 2 ) 4- e. 

Note that K l <fg and K 2 <(1 — f)g. Since A is linear, it follows from (8) 
that 


KKi) + fi(K 2 ) < A(/gf) + A (g - fg ) = A g< fj(. K t u K 2 ) + e. 

Since e was arbitrary, (10) follows now from Step I. 

If n(E) = oo, (9) follows from Step I. Assume therefore that g(E) < oo, 
and choose e > 0. Since E t e $Jl F , there are compact sets H t a E t with 

KHd > n(E t ) -2~‘e (i= 1,2,3,...). (11) 

Putting K„ = H l u • •• u H n and using induction on (10), we obtain 

H(E) > n(K n ) = t n(H>) > £ KE t ) - e. (12) 

i = 1 i= 1 

Since (12) holds for every n and every e > 0, the left side of (9) is not smaller 
than the right side, and so (9) follows from Step I. 

But if g(E) < oo and e > 0, (9) shows that 

H(E) < £ niEi) + € (13) 

i = 1 

for some N. By (12), it follows that g(E) < g(K N ) + 2e, and this shows that E 
satisfies (3); hence E e //// 

step v If E eWlp and e > 0, there is a compact K and an open V such that 
KczEczV andp(V - K) < 6. 

Proof Our definitions show that there exist K a E and V => E so that 
H{V) - e -< H(E) < + \- 

Since V — K is open, V — K e , by Step III. Hence Step IV implies that 

H(K) + n(V — K) = f4V) < n(K) + e. //// 

step vi If A e 9ffl F and B e 9ffl F , then A — £, A u B, and A n B belong to 9ffl F . 
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proof If 6 > 0, Step V shows that there are sets X, and V t such that 
X x ^ A <= V l9 X 2 c B c= V 2 , and p{V t - X,) < €, for i = 1, 2. Since 

A _ B c= V t - X 2 cz (V, - K t ) u (K t - V 2 ) u (V 2 - X 2 ), 

Step I shows that 

p(A - B) < 6 + p(K 1 - V 2 ) + 6. (14) 

Since K x — V 2 is a compact subset of A — B, (14) shows that A — B satisfies 
(3), so that A — B e 90^ . 

Since A v B = (A — B) v B, an application of Step IV shows that 
A u B e 50l F . Since A n B = A — (A — B\ we also have A n B e 50l F . //// 

step vii 901 is a a-algebra in X which contains all Borel sets. 

proof Let K be an arbitrary compact set in X. 

If A e 901, then A c n K = K — (A n K\ so that A c n K is a difference of 
two members of Jl F . Hence A c n K e 9 0? F , and we conclude: A e 901 implies 
A c e m. 

Next, suppose A = (J J° A i9 where each A t e 901 Put B x = A t n X, and 

B n = (A n n K) - (B x u • • • u B n _ t ) (n = 2, 3, 4, .. .). (15) 

Then { B n } is a disjoint sequence of members of 90i F , by Step VI, and 
A n K = (J B n . It follows from Step IV that A n K e 90l F . Hence A e 901 
Finally, if C is closed, then C n K is compact, hence C n K e 90l F , so 
C e 901. In particular, X e 901. 

We have thus proved that 901 is a a-algebra in X which contains all 
closed subsets of X. Hence 901 contains all Borel sets in X. //// 

step vm 90 ? f consists of precisely those sets E e 901 for which /i(E) < oo. 

This implies assertion ( d ) of the theorem. 

Proof If E e 90? F , Steps II and VI imply that E n K e 90? F for every 
compact X, hence E e 901. 

Conversely, suppose E e 901 and p(E) < oo, and choose e > 0. There is an 
open set V => E with ju(V) < oo; by III and V, there is a compact X V with 
p(V — X) < 6. Since E n X e 90? F , there is a compact set H a E n X with 

p(E n X) < p(H) + 6. 

Since E a (E n X) u (V — X), it follows that 

ti{E) < p(E n X) + p(V - X) < /i(H) + 2e, 
which implies that E e 90? F . //// 


step ix p is a measure on 901. 
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Proof The countable additivity of on 9W follows immediately from Steps 
IV and VIII. I III 


step x For every f e C c (X\ A f = j x f dp. 


This proves (a), and completes the theorem. 

Proof Clearly, it is enough to prove this for real /. Also, it is enough to 
prove the inequality 


A/ < f / dfi (16) 

for every real / e C c (X). For once (16) is established, the linearity of A shows 
that 


-A f= A (-/) < f (-/) dp = - f fd* 

Jx Jx 

which, together with (16), shows that equality holds in (16). 

Let K be the support of a real / e C c (X), let [a, b] be an interval which 
contains the range of / (note the Corollary to Theorem 2.10), choose € > 0, 
and choose y { , for i = 0, 1, . . . , n, so that y t — i < € and 

y 0 <a<y l <--- <y„ = b. (17) 

Put 

Ei = {x: <f(x) < y,} n K (f = 1, .... n). (18) 

Since / is continuous, / is Borel measurable, and the sets E t are therefore 

disjoint Borel sets whose union is K. There are open sets V t -=> E t such that 

KK) < ME;) + ~ n (i = 1, . . . , n) (19) 

and such that f(x) < y { + e for all x e V { . By Theorem 2.13, there are func- 

tions h t -< V { such that £ h t = 1 on K. Hence / = £ h t f 9 and Step II shows 
that 


ME) < A£ h,) = X A h t . 
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Since h t f <(y { + e)hi , and since y t — e <f (x) on £, , we have 


A/= (hj)< E(^ + €)A^. 

i = 1 i = 1 

= E ( I a | + ^ + e)A/ji - | a | £ A/i| 

i=l i= 1 

< E ( I a I + y,- + «) W E,) + e/ri] - \ a \ n(K) 

i — 1 

= i (y, - €)/.(£,) + 2 €/i(K) + 1 E(M + y< + €) 

i= 1 W £=1 

< J /d/i + €[2/i(K) + | a I + b + €]. 

Since e was arbitrary, (16) is established, and the proof of the theorem is 
complete. //// 


Regularity Properties of Borel Measures 

2.15 Definition A measure defined on the a-algebra of all Borel sets in a 
locally compact Hausdorff space X is called a Borel measure on X. If [i is 
positive, a Borel set E c= X is outer regular or inner regular , respectively, if E 
has property (c) or ( d) of Theorem 2.14. If every Borel set in X is both outer 
and inner regular, n is called regular. 

In our proof of the Riesz theorem, outer regularity of every set E was 
built into the construction, but inner regularity was proved only for the open 
sets and for those E e 901 for which g(E) < oo. It turns out that this flaw is in 
the nature of things. One cannot prove regularity of n under the hypothesis 
of Theorem 2.14; an example is described in Exercise 17. 

However, a slight strengthening of the hypotheses does give us a regular 
measure. Theorem 2.17 shows this. And if we specialize a little more, 
Theorem 2.18 shows that all regularity problems neatly disappear. 


2.16 Definition A set £ in a topological space is called o-compact if £ is a 
countable union of compact sets. 

A set £ in a measure space (with measure p) is said to have a-finite 
measure if £ is a countable union of sets E x with /*(£*) < oo. 

For example, in the situation described in Theorem 2.14, every <7- 
compact set has a-finite measure. Also, it is easy to see that if £ e 901 and £ 
has a-finite measure, then £ is inner regular. 
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2.17 Theorem Suppose X is a locally compact , o-compact Hausdorff space. If 
and p are as described in the statement of Theorem 2.14 , then and p have 
the following properties: 

(a) If E eWI and e > 0, there is a closed set F and an open set V such that 
F cz E cz V and p(V — F) < e. 

(b) p is a regular Borel measure on X. 

(c) If E e there are sets A and B such that A is an F a9 B is a G d , 
A cz E cz B 9 and p(B — A) = 0. 


As a corollary of (c) we see that every E e is the union of an F a and a set 
of measure 0. 

Proof Let X = K x u K 2 u K 3 u • • • , where each K n is compact. If E eWl 
and 6 > 0, then p(K n n E) < oo, and there are open sets V n id K n n E such 
that 

n(K - (K. n £)) < ^ (n = 1, 2, 3, . . .)• (1) 

If V = (J V n , then V - E <= (J (F„ - (K„ n E)), so that 

H(V-E)< € -. 

Apply this to E c in place of E : There is an open set W zz> E c such that 
p(W - E c ) < e/2. If F = W c 9 then F c £, and E - F = W - E c . Now (a) 
follows. 

Every closed set F a X is a-compact, because F = (J (F n K n ). Hence 

(a) implies that every set E e is inner regular. This proves (b). 

If we apply (a) with e = 1 jj (j = 1, 2, 3, . . .), we obtain closed sets Fj and 
open sets Vj such that F j a E a Vj and p(Vj — F < l/j. Put A = [j Fj and 
B = P| Vj, Then A a E cz B, A is an F a , B is a G s , and p(B — A) = 0 since 

B — A cz Vj — Fj for j = 1, 2, 3, ... . This proves (c). //// 


2.18 Theorem Let X be a locally compact Hausdorff space in which every open 
set is o-compact. Let X be any positive Borel measure on X such that X(K) < oo 
for every compact set K. Then X is regular. 


Note that every euclidean space R k satisfies the present hypothesis, since 
every open set in R k is a countable union of closed balls. 
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Proof Put A f = j x f ^2, for / e C c (2Q. Since X(K) < oo for every compact K , 
A is a positive linear functional on C c (X ), and there is a regular measure p, 
satisfying the conclusions of Theorem 2.17, such that 



(/e C c (X)). 


( 1 ) 


We will show that 2 = p. 

Let F be open in X. Then F = (J where is compact, 
i = 1, 2, 3, .... By Urysohn’s lemma we can choose f t so that K t < F. Let 
g n = max (f i9 ...,/„). Then e C c (AT) and g n (x) increases to Xv( x ) at every 
point x e X. Hence (1) and the monotone convergence theorem imply 

MY) = Km fg n dX= lim f g„ d\i = p(F) . (2) 

n-» oo Jx n-»oo Jx 

Now let £ be a Borel set in X, and choose e > 0. Since p satisfies Theo- 
rem 2.17, there is a closed set F and an open set F such that F a E <= F and 
p(F — F) < e. Hence p(F) < p(F) + e < p(£) + 6. 

Since F — F is open, (2) shows that 2(F — F) < 6, hence 2(F) < 2(£) + 6. 

Consequently 


2(£) < 2(F) = p(F) < p(£) + € 
and ii{E) < p(F) = 2(F) < 2(£) + 6 , 

so that | 2(£) — g(E) | < e for every e > 0. Hence 2(E) = p(£). //// 

In Exercise 18 a compact Hausdorff space is described in which the com- 
plement of a certain point fails to be cr-compact and in which the conclusion of 
the preceding theorem is not true. 

Lebesgue Measure 

2.19 Euclidean Spaces Euclidean fc-dimensional space R k is the set of all points 
x = (^j, ..., £ k ) whose coordinates are real numbers, with the following alge- 
braic and topological structure: 

If x = (^j, . . . , £ k ), y = (rj u . . . , rj k ), and a is a real number, x + y and ax are 
defined by 

x + y = {£ l + th,...,Z k + ri k ), otx = («<*!, ...,<xi k ). (1) 

This makes R k into a real vector space. If x • y = £ £,>/,• and | x | = (x • x) 1/2 , the 
Schwarz inequality |xy|<|x||y| leads to the triangle inequality 

\x-y\<\x-z\ + \z-y\; (2) 

hence we obtain a metric by setting p(x, y) = | x — y | . We assume that these facts 
are familiar and shall prove them in greater generality in Chap. 4. 
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If E a R k and x g R k , the translate ofEbyx is the set 

E + x = {y + x: y g E). (3) 

A set of the form 

W = {x: a f < £i < Pi, 1 < i < k}, (4) 

or any set obtained by replacing any or all of the < signs in (4) by < , is called a 
k-cell; its volume is defined to be 

vol (HO = ft (ft - aft (5) 

i= 1 

If a g R k and 3 > 0, we shall call the set 


Q(a\ 3) = {x: (x t < + 3, 1 < i < k} 


( 6 ) 


the 3-box with corner at a. Here a = (a x , . . . , cc k ). 

For n = 1, 2, 3, we let P n be the set of all x g R k whose coordinates are 
integral multiples of 2~ n , and we let Q n be the collection of all 2~ n boxes with 
corners at points of P n . We shall need the following four properties of {Q„}. The 
first three are obvious by inspection. 

(a) If n is fixed, each x e R k lies in one and only one member of Q„ . 

(b) If Q' e Q n , Q” g O r , and r < n, then either Q a Q" or Q ' n Q " = 0. 

(c) If Q g Q r , then vol (Q) = 2~ rk ; and if n > r, the set P n has exactly 2 in ~ r)k 
points in Q. 

(d) Every nonempty open set in R k is a countable union of disjoint boxes belonging 
to u Q 2 u fi 3 u • • • . 


Proof of (d) If V is open, every x g V lies in an open ball which lies in V ; 
hence x e Qa V for some Q belonging to some Q„ . In other words, V is the 
union of all boxes which lie in V and which belong to some From this 
collection of boxes, select those which belong to and remove those in Q 2 , 
Q 3 , . . . which lie in any of the selected boxes. From the remaining collection, 
select those boxes of Q 2 which lie in V , and remove those in Q 3 , Q 4 , ... 
which lie in any of the selected boxes. If we proceed in this way, (a) and ( b ) 
show that (d) holds. //// 

2.20 Theorem There exists a positive complete measure m defined on a a- 
algebra in R k , with the following properties: 

(a) m(W) = vol (W) for every k-cell W. 

(b) contains all Borel sets in R k ; more precisely, E gWI if and only if there 
are sets A and B a R k such that A a E a B, A is an F a , B is a G d , and 
m(B — A) = 0. Also, m is regular. 
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(c) m is translation-invariant , i.c., 

m(E + x) = m(E) 

for every £ e 901 and every x e R k . 

(d) If p is any positive translation-invariant Borel measure on R k such that 
KK) < oo for every compact set K 9 then there is a constant c such that 
p(E) = cm(E) for all Borel sets E a R k . 

(e) T o every linear transformation T of R k into R k corresponds a real number 
A (T) such that 


m(T(E)) = A(T)m(E) 

for every £ e 901. /n particular , m(T(E)) = m(E) when T is a rotation. 


The members of 901 are the Lebesgue measurable sets in R k ; m is the Lebesgue 
measure on R k . When clarity requires it, we shall write m k in place of m. 


Proof If / is any complex function on R k , with compact support, define 

A„ /= 2-* £ f(x) (n= 1,2,3,...), (1) 

xePn 

where P n is as in Sec. 2.19. 

Now suppose / e C c (R%f is real, W is an open /c-cell which contains the 
support of f and e > 0. The uniform continuity of /([ 26], Theorem 4.19) 
shows that there is an integer N and that there are functions g and h with 
support in W 9 such that (i) g and h are constant on each box belonging to 
CV (ii) g <f <h 9 and (iii) h — g < e. If n > N, Property 2.19(c) shows that 


A*0 = A„g<, A„/< A„h = A N h. (2) 

Thus the upper and lower limits of {A„/} differ by at most e vol (W), and 
since e was arbitrary, we have proved the existence of 

A/= lim A„/ (f e C c (R k )). (3) 

n~* oo 

It is immediate that A is a positive linear functional on C c (R k ). (In fact, A f 
is precisely the Riemann integral of / over R k . We went through the preceding 
construction in order not to have to rely on any theorems about Riemann 
integrals in several variables.) 

We define m and 901 to be the measure and o-algebra associated with this A 
as in Theorem 2.14. 

Since Theorem 2.14 gives us a complete measure and since R k is a- 
compact, Theorem 2.17 implies assertion (b) of Theorem 2.20. 
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To prove (a), let W be the open cell 2.19(4), let E r be the union of those 
boxes belonging to Q r whose closures lie in W, choose f r so that E r < 
f r < W, and put g r = max {f l9 . . . ,/ r }. Our construction of A shows that 

vol ( E r ) < A f r < A g r < vol W. (4) 

As r— > oo, vol (E r )—> vol (W) 9 and 

A g r = j*0 r dm-> m(W) (5) 

by the monotone convergence theorem, since g£x)-+ Xw( x ) for all x e R k . 
Thus m(W) = vol (W) for every open cell W 9 and since every k-cell is the 
intersection of a decreasing sequence of open /c-cells, we obtain (a). 

The proofs of (c), (d), and ( e ) will use the following observation: If X is a 
positive Borel measure on R k and X(E) = m(E) for all boxes E , then the same 
equality holds for all open sets E , by property 2.19(d), and therefore for all 
Borel sets E 9 since X and m are regular (Theorem 2.18). 

To prove (c), fix x e R k and define X(E) = m(E + x). It is clear that X is 
then a measure; by (a), X(E) = m(E) for all boxes, hence m(E + x) = m(E) for 
all Borel sets E. The same equality holds for every E e 2R, because of ( b ). 

Suppose next that satisfies the hypotheses of (d). Let Q 0 be a 1-box, put 
c — g(Q 0 ). Since Q 0 is the union of 2 nk disjoint 2~ n boxes that are translates 
of each other, we have 

2 nk m = KQo) = cm(Q 0 ) = c • 2 nk m(Q) 

for every 2 ""-box Q. Property 2.19(d) implies now that g(E) = cm(E) for all 
open sets E c= R k . This proves (d). 

To prove (e) 9 let T : R k ^> R k be linear. If the range of T is a subspace Y 
of lower dimension, then m(Y) = 0 and the desired conclusion holds with 
A(T) = 0. In the other case, elementary linear algebra tells us that T is a 
one-to-one map of R k onto R k whose inverse is also linear. Thus T is a 
homeomorphism of R k onto R k 9 so that T(E) is a Borel set for every Borel set 
E 9 and we can therefore define a positive Borel measure \i on R k by 

KE) = m(T(E)). 

The linearity of T, combined with the translation-invariance of m, gives 
g(E + x) = m(T(E + x)) = m(T(E) + Tx ) = m(T(E)) = g(E). 

Thus \i is translation-invariant, and the first assertion of ( e ) follows from 
(d), first for Borel sets E 9 then for all E e 901 by ( b ). 

To find A(T), we merely need to know m(T(E))/m(E) for one set E with 
0 < m(E) < oo. If T is a rotation, let E be the unit ball of R k ; then T(E) = E 9 
and A (T) = 1. //// 
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2.21 Remarks If m is the Lebesgue measure on R k , it is customary to write l}(R k ) 
in place of l}(m). If E is a Lebesgue measurable subset of R k , and if m is restricted 
to the measurable subsets of E , a new measure space is obtained in an obvious 
fashion. The phrase “/ e L 1 on £” or “/ e !/(£)” is used to indicate that / is 
integrable on this measure space. 

If k = 1, if / is any of the sets (a, b), (a, ft], [a, b), [a, ft], and if / e Lf(/), it is 
customary to write 


J / (x) dx in place of J / dm. 

Since the Lebesgue measure of any single point is 0, it makes no difference over 
which of these four sets the integral is extended. 

Everything learned about integration in elementary Calculus courses is still 
useful in the present context, for if f is a continuous complex function on [a, fe], 
then the Riemann integral of f and the Lebesgue integral of f over [a, b~\ coincide. 
This is obvious from our construction if /(a) = f(b) = 0 and if /(x) is defined to be 
0 for x < a and for x > b. The general case follows without difficulty. Actually 
the same thing is true for every Riemann integrable / on [a, b~\. Since we shall 
have no occasion to discuss Riemann integrable functions in the sequel, we omit 
the proof and refer to Theorem 1 1.33 of [26]. 

Two natural questions may have occurred to some readers by now: Is every 
Lebesgue measurable set a Borel set? Is every subset of R k Lebesgue measurable? 
The answer is negative in both cases, even when k = 1. 

The first question can be settled by a cardinality argument which we sketch 
briefly. Let c be the cardinality of the continuum (the real line or, equivalently, 
the collection of all sets of integers). We know that R k has a countable base (open 
balls with rational radii and with centers in some countable dense subset of R% 
and that @ k (the collection of all Borel sets of R k ) is the a-algebra generated by 
this base. It follows from this (we omit the proof) that has cardinality c. On 
the other hand, there exist Cantor sets E a R l with m(E) = 0. (Exercise 5.) The 
completeness of m implies that each of the 2 C subsets of E is Lebesgue measur- 
able. Since 2 C > c , most subsets of E are not Borel sets. 

The following theorem answers the second question. 


2.22 Theorem If A a R 1 and every subset of A is Lebesgue measurable then 
m(A) = 0. 

Corollary Every set of positive measure has nonmeasurable subsets. 

Proof We shall use the fact that R 1 is a group, relative to addition. Let Q be 
the subgroup that consists of the rational numbers, and let £ be a set that 
contains exactly one point from each coset of Q in R 1 . (The assertion that 



54 REAL AND COMPLEX ANALYSIS 


there is such a set is a direct application of the axiom of choice.) Then E has 
the following two properties. 

(a) (E + r) n (E + s) = 0 if r e Q, s e Q,r ^ s. 

(b) Every x e R 1 lies in E + r for some r e Q. 

To prove (a), suppose x e (E + r) n (E 4- s). Then x = y + r = z + s for 

some y e E, z e E, y ^ z. But y — z = s — re Q, so that y and z lie in the 

same coset of Q, a contradiction. 

To prove ( b ), let y be the point of E that lies in the same coset as x, put 
r = x — y. 

Fix t e Q , for the moment, and put A t = A n (E 4 - 1)- By hypothesis, A t 
is measurable. Let K c= A, be compact, let H be the union of the translates 
K + r, where r ranges over Q n [0, 1]. Then H is bounded, hence m(H) < oo. 
Since K c= E + t, (a) shows that the sets K + r are pairwise disjoint. Thus 
m(H) = £ r m(K + r). But m(K + r) = m(K). It follows that m(K) = 0. This 
holds for every compact K a A t . Hence m(A t ) = 0. 

Finally, (b) shows that A = (J A t , where t ranges over Q. Since Q is 
countable, we conclude that m(A) — 0. //// 

2.23 Determinants The scale factors A (T) that occur in Theorem 2.20(e) can be 
interpreted algebraically by means of determinants. 

Let {e u . .., e k } be the standard basis for R k : the ith coordinate of ^ is 1 if 
i = j , 0 if i ^ 7 . If T" : .R* — > is linear and 

Te i=Yj* ij e i (l<j<k) ( 1 ) 

i= 1 

then det T is, by definition, the determinant of the matrix [T] that has in row 
i and column j. 

We claim that 


A(T) = |det T | . (2) 

If T = T X T 2 , it is clear that A(T) = A(T 1 )A(T 2 ). The multiplication theorem 
for determinants shows therefore that if (2) holds for and T 2 , then (2) also 
holds for T. Since every linear operator on R k is a product of finitely many linear 
operators of the following three types, it suffices to establish ( 2 ) for each of these: 

(I) {Te u . . . , Te k } is a permutation of {e u . . . , e k }. 

(II) Te^ = <xe u Te ( = e t for i = 2, . . . , k. 

(Ill) T^ = + e 2 , for i = 2, . . . , k. 

Let Q t> e the cube consisting of all x = (£ l5 ..., £ k ) with 0 < ^ < 1 for 
i 1 , . . . , k. 

If T is of type (I), then [T] has exactly one 1 in each row and each column 
and has 0 in all other places. So detT=±l. Also, T(Q) = Q. So 
A(T) =1 = | det T | . 
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If T is of type (II), then clearly A(T) = | a | = | det T | . 

If T is of type (III), then det T = 1 and T(Q) is the set of all points £ 
whose coordinates satisfy 


€i£€ 2 <€i + 1, 0 < ^- < 1 if i # 2. (3) 

If Si is the set of points in T(Q) that have £ 2 < 1 and if S 2 is the rest of T(Q), then 

Si u (S 2 - e 2 ) = G, (4) 

and S t n (S 2 — e 2 ) is empty. Hence A(T) = m(S t u S 2 ) = m^) + m(S 2 — e 2 ) = 
w(G) = 1, so that we again have A(T) = | det T | . 

Continuity Properties of Measurable Functions 

Since the continuous functions played such a prominent role in our construction 
of Borel measures, and of Lebesgue measure in particular, it seems reasonable to 
expect that there are some interesting relations between continuous functions and 
measurable functions. In this section we shall give two theorems of this kind. 

We shall assume, in both of them, that \i is a measure on a locally compact 
Hausdorff space X which has the properties stated in Theorem 2.14. In particular, 
p could be Lebesgue measure on some R k . 

2.24 Lusin’s Theorem Suppose f is a complex measurable function on X , 
H(A) < ooj(x) = 0 if x $ A, and e > 0. Then there exists age C c (X) such that 

K{x-f(x) # gf(x)}) < €. (1) 

Furthermore , we may arrange it so that 

sup |gf(x)| < sup |/(x)|. ( 2 ) 

xeX xeX 


Proof Assume first that 0 </ < 1 and that A is compact. Attach a sequence 
{s„} to f as in the proof of Theorem 1.17, and put t i = Si and t n = s n — s n . i 
for n = 2, 3, 4, Then 2 n t n is the characteristic function of a set T n c = A, and 

fix) = £ t„(x) (X 6 X). (3) 

n - 1 

Fix an open set V such that A c V and V is compact. There are compact 
sets K n and open sets V n such that K n cz T n cz V n a V and p(V n — K n ) <2~ n e. 
By Urysohn’s lemma, there are functions h n such that K n -< h n -< V n . Define 

0 (x)= £2 -”h n (x) (X 6 X). (4) 

n = 1 

This series converges uniformly on X , so g is continuous. Also, the support of 
g lies in V. Since 2 ~ n h n (x) = t n (x) except in V n — K n , we have g(x) =f(x) 
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except in (J (V n — K n ), and this latter set has measure less than e. Thus (1) 
holds if A is compact and 0 </< 1. 

It follows that (1) holds if A is compact and / is a bounded measurable 
function. The compactness of A is easily removed, for if p(A) < oo then A 
contains a compact set K with p(A — K) smaller than any preassigned posi- 
tive number. Next, if / is a complex measurable function and if B n = 
{x: \f(x) \ > n}, then f) B n = 0, so /*(£„)—► 0, by Theorem 1.19(e). Since / 
coincides with the bounded function (1 — Xb „ ) '/ except on B ni (1) follows in 
the general case. 

Finally, let R = sup { \f(x)\ : x e X }, and define cp(z ) = z if \z \ < R, 

cp(z) = Rz/ \z\ if | z | > R. Then (p is a continuous mapping of the complex 

plane onto the disc of radius R. If g satisfies (1) and g x = cp o g 9 then g x 

satisfies (1) and (2). //// 

Corollary Assume that the hypotheses of Lusiris theorem are satisfied and that 
| / 1 < 1. Then there is a sequence {g n } such that g n e C c (X\ \ g n | < 1, and 

f(x) = lim g n (x) a.e. (5) 


Proof The theorem implies that to each n there corresponds a g n e C c (X ), 
with \g n \ < 1, such that p(E n ) < 2"", where E n is the set of all x at which 
/(x) j=- g n (x ). For almost every x it is then true that x lies in at most finitely 
many of the sets E n (Theorem 1.41). For any such x, it follows that /(x) = 
g n (x) for all large enough n. This gives (5). //// 

2.25 The Vitali-Caratheodory Theorem Suppose f e L x (/i),/ is real-valued , and 
e > 0. Then there exist functions u and v on X such that u <f < v, u is upper 
semicontinuous and bounded above , v is lower semicontinuous and bounded 
below , and 


Ja 


(v — u) dp < 6. 


(1) 


Proof Assume first that / > 0 and that / is not identically 0. Since / is the 
pointwise limit of an increasing sequence of simple functions s n , / is the sum 
of the simple functions t n — s n — s n _ x (taking s 0 = 0), and since t n is a linear 
combination of characteristic functions, we see that there are measurable sets 
£, (not necessarily disjoint) and constants c t > 0 such that 


00 


/(*)= Z c.-Xe.M 

i= 1 


(x e X). 


(2) 
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Since 


I 


fdn = £c, n(E t ), 


(3) 


the series in (3) converges. There are compact sets K t and open sets V t such 
that K, c= E: c= V: and 


CiH(Vi-K)< 2- i ~ 1 e (<= 1,2,3,...). 


Put 


V=Z C iXK,> U=Y,CiXK„ 

i — 1 i — 1 


where JV is chosen so that 


Z c i < v 

JV-t- i ^ 


(4) 

(5) 

(6) 


Then v is lower semicontinuous, u is upper semicontinuous, u <f < v 9 and 

AT oo 

v ~ u = Z c i(XK,- - Z*i) + Z c i Xr, 


— Z C i(Xv i - Xk.) + Z c i Xe,- 


N+ 1 


so that (4) and (6) imply (1). 

In the general case, write / = / + — /“, attach u t and to / + , attach w 2 
and v 2 to /“, as above, and put w = — i; 2 , v = — u 2 . Since — u 2 is upper 

semicontinuous and since the sum of two upper semicontinuous functions is 
upper semicontinuous (similarly for lower semicontinuous; we leave the 
proof of this as an exercise), u and v have the desired properties. //// 

Exercises 

1 Let {/„} be a sequence of real nonnegative functions on R l , and consider the following four state- 
ments : 

(a) If f x and f 2 are upper semicontinuous, then f x + f 2 is upper semicontinuous. 

(b) If /j and f 2 are lower semicontinuous, then f x -I- f 2 is lower semicontinuous. 

(c) If each /„ is upper semicontinuous, then f n is upper semicontinuous. 

(d) If each /„ is lower semicontinuous, then /„ is lower semicontinuous. 

Show that three of these are true and that one is false. What happens if the word “ nonnegative ” 
is omitted? Is the truth of the statements affected HR 1 is replaced by a general topological space? 

2 Let /be an arbitrary complex function on R l , and define 

<p(x, S) = sup { | f(s) - f(t ) | : s, t e {x - S, x + S )}, 

(p(x) = inf {(p(x, S): S > 0}. 

Prove that (p is upper semicontinuous, that / is continuous at a point x if and only if (p{x) = 0, and 
hence that the set of points of continuity of an arbitrary complex function is a G b . 

Formulate and prove an analogous statement for general topological spaces in place of R 1 . 
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3 Let X be a metric space, with metric p. For any nonempty E cz X, define 

p^x) = inf {p(x, y ): y e £}. 

Show that p E is a uniformly continuous function on X. If A and B are disjoint nonempty closed 
subsets of X, examine the relevance of the function 


/(*) = 


Pa(*) 

Pa( x ) + PaM 


to'Urysohn’s lemma. 

4 Examine the proof of the Riesz theorem and prove the following two statements : 

(a) If E x cz V x and E 2 cz V 2 , where V x and V 2 are disjoint open sets, then p(E x u E 2 ) = 
p(E x ) + p{E 2 ), even if E x and E 2 are not in 9R. 

(b) If E e $R F , then E = N u K x u K 2 u • • • , where {K t } is a disjoint countable collection of 
compact sets and p(N) = 0. 

In Exercises 5 to 8, m stands for Lebesgue measure on R 1 . 

5 Let E be Cantor’s familiar “middle thirds” set. Show that m(E) = 0, even though E and R l have 
the same cardinality. 

6 Construct a totally disconnected compact set K c R l such that m{K) > 0. ( K is to have no con- 
nected subset consisting of more than one point.) 

If v is lower semicontinuous and v < Xk , show that actually v < 0. Hence Xk cannot be approx- 
imated from below by lower semicontinuous functions, in the sense of the Vitali-Caratheodory 
theorem. 

7 If 0 < e < 1, construct an open set E cz [0, 1] which is dense in [0, 1], such that m(E) = e. (To say 
that A is dense in B means that the closure of A contains B.) 

8 Construct a Borel set E cz R l such that 

0 < m(E n I) < m(7) 

for every nonempty segment /. Is it possible to have m{E) < oo for such a set? 

9 Construct a sequence of continuous function /„ on [0, 1] such that 0 < /„ < 1, such that 

lim f f n (x ) dx — 0, 

w -» oo JO 

but such that the sequence {/„(*)} converges for no x e [0, 1]. 

10 If {/„} is a sequence of continuous functions on [0, 1] such that 0 < /„ < 1 and such that /„(x) — > 0 
as n — ► oo, for every x e [0, 1], then 


lim f /„(x) dx = 0. 

n~* oo JO 

Try to prove this without using any measure theory or any theorems about Lebesgue integration. 
(This is to impress you with the power of the Lebesgue integral. A nice proof was given by W. F. 
Eberlein in Communications on Pure and Applied Mathematics , vol. X, pp. 357-360, 1957.) 

11 Let p be a regular Borel measure on a compact Hausdorff space X; assume p{X) = 1. Prove that 
there is a compact set K cz X (the carrier or support of p) such that p{K) - 1 but p{H) < 1 for every 
proper compact subset H of K. Hint : Let K be the intersection of all compact K a with p{K a ) = 1 ; 
show that every open set V which contains K also contains some K a . Regularity of p is needed; 
compare Exercise 18. Show that K c is the largest open set in X whose measure is 0. 

12 Show that every compact subset of R 1 is the support of a Borel measure. 
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13 Is it true that every compact subset of R l is the support of a continuous function? If not, can you 
describe the class of all compact sets in R l which are supports of continuous functions? Is your 
description valid in other topological spaces? 

14 Let / be a real- valued Lebesgue measurable function on R k . Prove that there exist Borel functions 
g and h such that g(x ) = h(x ) a.e. [m], and g{x) </(x) < h(x) for every x e R k . 

15 It is easy to guess the limits of 


JTH)' 


e x/2 dx and 



dx. 


as n— > oo. Prove that your guesses are correct. 

16 Why is m(Y) = 0 in the proof of Theorem 2.20(e)? 

17 Define the distance between points (x lf and ( x 2 , y 2 ) in the plane to be 


\y x -y 2 \ if x x = x 2 , l + l^i-^l 

Show that this is indeed a metric, and that the resulting metric space X is locally compact. 

If /e C c (X), let x l5 . . . , x n be those values of x for which /(x, y) # 0 for at least one y (there are 
only finitely many such x !), and define 


A/= £ f" f(xj, y) dy. 

j= 1 J- 00 

Let (j. be the measure associated with this A by Theorem 2.14. If E is the x-axis, show that n(E) = oo 
although f. i{K ) = 0 for every compact K c E. 

18 This exercise requires more set-theoretic skill than the preceding ones. Let X be a well-ordered 
uncountable set which has a last element co 1} such that every predecessor of cwj has at most countably 
many predecessors. (“Construction”: Take any well-ordered set which has elements with uncountably 
many predecessors, and let a> 1 be the first of these; a> 1 is called the first uncountable ordinal.) For 
a e X, let P a [SJ be the set of all predecessors (successors) of a, and call a subset of X open if it is a P a 
or an S fi or a P a n S fi or a union of such sets. Prove that X is then a compact Hausdorflf space. ( Hint : 
No well-ordered set contains an infinite decreasing sequence.) 

Prove that the complement of the point coj is an open set which is not a-compact. 

Prove that to every / e C(X) there corresponds an a # a> l such that /is constant on S a . 

Prove that the intersection of every countable collection {K„j of uncountable compact subsets of 
X is uncountable. ( Hint : Consider limits of increasing countable sequences in X which intersect each 
K n in infinitely many points.) 

Let 50t be the collection of all E cz X such that either E u {coj } or E c u {coj} contains an 
uncountable compact set ; in the first case, define X{E) = 1 ; in the second case, define 2(£) = 0. Prove 
that 501 is a a-algebra which contains all Borel sets in X, that X is a measure on 501 which is not regular 
(every neighborhood of a> l has measure 1), and that 


f(co i ) = ^fdX 

for every / e C{X). Describe the regular n which Theorem 2.14 associates with this linear functional. 

19 Go through the proof of Theorem 2.14, assuming X to be compact (or even compact metric) 
rather than just locally compact, and see what simplifications you can find. 

20 Find continuous functions /„: [0, 1]— ► [0, oo) such that /„(x)— ► 0 for all x e [0, 1] as n-> oo, 
Jo fn( x ) dx—* 0, but sup„ f H is not in L 1 . (This shows that the conclusion of the dominated convergence 
theorem may hold even when part of its hypothesis is violated.) 

21 If X is compact and /: X— ►( — oo, oo) is upper semicontinuous, prove that / attains its maximum 
at some point of X. 
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22 Suppose that X is a metric space, with metric d, and that /: X — ► [0, oo] is lower semicontinuous, 
f(p ) < oo for at least one p e X. For n = 1, 2, 3 , ...,x e X, define 

g n (x) = inf {/(p) + nd(x, p ): p e X} 

and prove that 

(i) \9n(x)~ g n {y)\ <nd(x,y\ 

(ii) 0 < q x < g 2 < ••• </, 

(iii) 0„(x)— ► /(x) as n— ► oo, for all xel 

Thus / is the pointwise limit of an increasing sequence of continuous functions. (Note that the con- 
verse is almost trivial.) 

23 Suppose V is open in R k and p is a finite positive Borel measure on R k . Is the function that sends x 
to p(V + x) necessarily continuous? lower semicontinuous? upper semicontinuous? 

24 A step function is, by definition, a finite linear combination of characteristic functions of bounded 
intervals in R l . Assume / e l}(R l ), and prove that there is a sequence {g n } of step functions so that 

lim j |/(x)-0„(x)| dx = 0. 

n-*a o J- oo 

25 (i) Find the smallest constant c such that 

log (1 + e l ) < c + t (0 < t < oo). 

(ii) Does 

lim - f log {1 + ^" /(x) } dx 

n-*oo n JO 

exist for every real /e L 1 ? If it exists, what is it? 



CHAPTER 
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Convex Functions and Inequalities 

Many of the most common inequalities in analysis have their origin in the notion 
of convexity. 

3.1 Definition A real function cp defined on a segment ( a , b\ where 
— oo < a < b < oo, is called convex if the inequality 

cp(( 1 — X)x + X y) < (1 — X)cp(x) 4- X cp(y) (1) 

holds whenever a < x < b, a < y < b, and 0 < X < 1. 

Graphically, the condition is that if x < t < y, then the point (f, (p(t )) 
should lie below or on the line connecting the points (x, cp(x)) and (y, cp(y)) in 
the plane. Also, (1) is equivalent to the requirement that 

<p(t) - <p(s) < <f>{u) - <f>(t) 

t—s~u—t ' 


whenever a < s <t <u < b. 


The mean value theorem for differentiation, combined with (2), shows imme- 
diately that a real differentiable function cp is convex in ( a , b) if and only if 
a < s < t < b implies cp'(s ) < (p'(t), i.e., if and only if the derivative cp' is a mono- 
tonically increasing function. 

For example, the exponential function is convex on (— oo, oo). 

3.2 Theorem If cp is convex on ( a , b) then (p is continuous on (a, b). 
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Proof The idea of the proof is most easily conveyed in geometric language. 
Those who may worry that this is not “ rigorous ” are invited to transcribe it 
in terms of epsilons and deltas. 

Suppose a < s < x < y < t <b. Write S for the point (s, <p(s)) in the 
plane, and deal similarly with x, y, and t. Then X is on or below the line SY, 
hence Y is on or above the line through S and X ; also, Y is on or below XT . 
As y— > x, it follows that Y — ► X, i.e., <p(y)— > <p(x ). Left-hand limits are handled 
in the same manner, and the continuity of (p follows. //// 

Note that this theorem depends on the fact that we are working on an open 
segment. For instance, if <p(x) = 0 on [0, 1) and cp( 1) = 1, then cp satisfies 3.1(1) on 
[0, 1] without being continuous. 

3.3 Theorem (Jensen’s Inequality) Let p be a positive measure on a o-algebra 
min a set Q, so that p(Q) = l. Iff is a real function in l}(p), if a </(x) <b for 
all xeQ, and if (p is convex on (a, b), then 


<®(j ' J dpj < ° f) dp. (1) 

Note : The cases a = — oo and b = o o are not excluded. It may happen that cp ° / 
is not in I}(p); in that case, as the proof will show, the integral of cp ° / exists in 
the extended sense described in Sec. 1.31, and its value is + oo. 


Proof Put t = / dp. Then a < t < b. If P is the supremum of the quotients 

on the left of 3.1(2), where a < s < t, then P is no larger than any of the 
quotients on the right of 3.1(2), for any u e (t, b). It follows that 

q>(s) > cp(t) + P(s — t) (a < s < b). (2) 

Hence 

cp(f(x)) - (p(t) - p(f(x) - t) > 0 (3) 

for every xefi. Since cp is continuous, (p ° / is measurable. If we integrate 
both sides of (3) with respect to p , (1) follows from our choice of t and the 
assumption p(Q) = 1. //// 

To give an example, take (p(x) = e x . Then (1) becomes 



If Q is a finite set, consisting of points p u . . . , p n , say, and if 


MiPi}) = l/n, f(Pi) = Xi, 
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(4) becomes 


exp <-(x t + 
In 


+ x„) f < - ( e xi + • • • 4- e Xn \ 


(5) 


for real x t . Putting y t = e Xi , we obtain the familiar inequality between the arith- 
metic and geometric means of n positive numbers : 


( y^2 ■ ■ ■ y») 1 /n z-(yi + y 2 + •■■ + yJ- 


( 6 ) 


Going back from this to (4), it should become clear why the left and right sides of 


exp 



(7) 


are often called the geometric and arithmetic means, respectively, of the positive 
function g. 

If we take p({pi}) = a* > 0, where £ a, = 1, then we obtain 


yVy? • • * yl n < y 2 + • • • + *„y n 


( 8 ) 


in place of (6). These are just a few samples of what is contained in Theorem 3.3. 
For a converse, see Exercise 20. 


3.4 Definition If p and q are positive real numbers such that p + q = pq, or 
equivalently 


then we call p and q a pair of conjugate exponents . It is clear that (1) implies 
1 < p < oo and 1 < q < oo. An important special case is p = q = 2. 

As p— > 1, (1) forces q— ► oo. Consequently 1 and oo are also regarded as a 
pair of conjugate exponents. Many analysts denote the exponent conjugate 
to p by p\ often without saying so explicitly. 


3.5 Theorem Let p and q be conjugate exponents , 1 < p < oo. Let X be a 
measure space , with measure p. Let f and g be measurable functions on X , with 
range in [0, oo]. Then 


and 




(i) 


{JW *}■%{[/', <,} w » 

The inequality (1) is Holder’s; (2) is Minkowski’s. If p = q = 2, (1) is known 
as the Schwarz inequality. 
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Proof Let A and B be the two factors on the right of (1). If A = 0, then / = 0 
a.e. (by Theorem 1.39); hence fg = 0 a.e., so (1) holds. If A > 0 and B = oo, (1) 
is again trivial. So we need consider only the case 0<,4<oo,0<£<oo. 
Put 


F 




(3) 


This gives 

I F p dp = I G q dp = 1. (4) 

jx jx 

If x e X is such that 0 < F(x) < oo and 0 < G(x) < oo, there are real 
numbers s and t such that F(x) = e slp , G(x) = e t,q . Since 1/p + 1 /q = 1, the 
convexity of the exponential function implies that 

e sl p + tlq < p -V + 4 "V. (5) 

It follows that 


F(x)G(x) < p~ 1 F(x) p + q-'Gix)* (6) 

for every x e X. Integration of (6) yields 

^FGdu^p- 1 +q~' = 1, (7) 

by (4); inserting (3) into (7), we obtain (1). 

Note that (6) could also have been obtained as a special case of the 


inequality 3.3(8). 

To prove (2), we write 

(/+ g) p =/•(/+ g) p ~ l + g • (/+ gY~ l - ( 8 ) 

Holder’s inequality gives 

jV (/+ g ) p - 1 < {| (/+ ( 9 ) 

Let (9') be the inequality (9) with / and g interchanged. Since (p — l)q = p, 
addition of (9) and (9') gives 


Clearly, it is enough to prove (2) in the case that the left side is greater 
than 0 and the right side is less than oo. The convexity of the function t p for 
0 < t < oo shows that 



< 5 (/ p + 0 p ). 
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Hence the left side of (2) is less than oo, and (2) follows from (10) if we divide 
by the first factor on the right of (10), bearing in mind that 1 — l/q = 1/p. 
This completes the proof. //// 

It is sometimes useful to know the conditions under which equality can hold 
in an inequality. In many cases this information may be obtained by examining 
the proof of the inequality. 

For instance, equality holds in (7) if and only if equality holds in (6) for 
almost every x. In (5), equality holds if and only if s = t. Hence “ F p = G q a.e.” is 
a necessary and sufficient condition for equality in (7), if (4) is assumed. In terms 
of the original functions / and p, the following result is then obtained: 

Assuming A < oo and B < oo, equality holds in (1) if and only if there are 
constants a and /?, not both 0 , such that of p = fig q a.e. 

We leave the analogous discussion of equality in (2) as an exercise. 

The Z?-spaces 

In this section, X will be an arbitrary measure space with a positive measure p. 

3.6 Definition If 0 < p < oo and if / is a complex measurable function on X , 
define 

,| /i | P = {|j/ |p ^} 1/P a) 

and let If(p) consist of all / for which 

i/ll, <00. (2) 

We call \\f\\ p the If-norm of /. 

If p is Lebesgue measure on R k , we write U(R k ) instead of L p (p), as in 
Sec. 2.21. If p is the counting measure on a set A, it is customary to denote 
the corresponding IF- space by f p (A), or simply by if A is countable. An 
element of may be regarded as a complex sequence x = {<!;„}, and 

r oo 'j i Ip 

IWI, = • 

3.7 Definition Suppose g: X— > [0, oo] is measurable. Let S be the set of all 
real a such that 

H{g~H{(x, oo])) = 0. (1) 

If 5 = 0, put P = oo. If S 0, put P = inf S. Since 

0 ])= 0 g~ l ^P + ^, oo 


( 2 ) 
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and since the union of a countable collection of sets of measure 0 has 
measure 0, we see that /? e S. We call /? the essential supremum of g. 

If/ is a complex measurable function on X , we define WfW^ to be the 
essential supremum of |/|, and we let L°°(p) consist of all / for which 
II /II oo < The members of L°°(p) are sometimes called essentially bounded 
measurable functions on X. 

It follows from this definition that the inequality \f(x)\ < X holds for almost all 
x if and only if X > ||/|| ^ . 

As in Definition 3.6, n°(R k ) denotes the class of all essentially bounded (with 
respect to Lebesgue measure) functions on R k , and /°°(,4) is the class of all 
bounded functions on A. (Here bounded means the same as essentially bounded, 
since every nonempty set has positive measure!) 

3.8 Theorem If p and q are conjugate exponents , 1 < p < oo, and if f e IF(p) 
and g e I3(fi), then fg e li(p\ and 

\\fgh<\\f\\ P \\g\\ q - (i) 

Proof For 1 < p < oo, (1) is simply Holder’s inequality, applied to |/| and 
M.ifp = oo, note that 


\f(x)g{x)\<\\f\\Jg{x)\ (2) 

for almost all x; integrating (2), we obtain (1). If p = 1, then q = oo, and the 
same argument applies. //// 

3.9 Theorem Suppose 1 < p < oo, and f e Zf(p), g e IF(fi). Then f + g e If(fi), 
and 


\\f+g\\ P <\\f\\ P +\\g\\ P - (i) 

Proof For 1 < p < oo, this follows from Minkowski’s inequality, since 

[\f+g\ p dp< [(\f\ + \g\ydiL 
Jx Jx 

For p — 1 or p — o o, (1) is a trivial consequence of the inequality 

l/+0l<l/l + M. //// 

3.10 Remarks Fix p, 1 < p < oo. If/e If(p) and a is a complex number, it is 
clear that of e IF(fi). In fact, 


ll a /llp = I a I ll/llp- 


( 1 ) 


In conjunction with Theorem 3.9, this shows that Zf(p) is a complex vector 
space. 
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Suppose /, g , and h are in H(p). Replacing fbyf—g and # by g — h in 
Theorem 3.9, we obtain 

\\f-h\\ p <\\f-g\\ p +\\g-h\\ p . (2) 

This suggests that a metric may be introduced in IF(p) by defining the dis- 
tance between / and g to be \\f— g\\ p . Call this distance d(f g) for the 
moment. Then 0 < d(f \ g) < oo, d(f \ f) = 0, d(f \ g) = d(g 9 /), and (2) shows 
that the triangle inequality d(f h) < d(f g) + d(g 9 h) is satisfied. The only 
other property which d should have to define a metric space is that 
d{f g) = 0 should imply that / = g. In our present situation this need not be 
so ; we have d(f g) = 0 precisely when f (x) = g(x) for almost all x. 

Let us write / ~ g if and only if d(f g) = 0. It is clear that this is an 
equivalence relation in LF(rf) which partitions U(p) into equivalence classes; 
each class consists of all functions which are equivalent to a given one. If F 
and G are two equivalence classes, choose / e F and g e G, and define 
d(F , G) = d(f g); note that / ~ / x and g ~ g^ implies 

d(f g) = d(f i9 g,\ 

so that d(F 9 G) is well defined. 

With this definition, the set of equivalence classes is now a metric space. 
Note that it is also a vector space, since / ~ / x and g ~ g t implies / + g ~ 
/x 4- g t and erf - crf v 

When LF(p) is regarded as a metric space, then the space which is really 
under consideration is therefore not a space whose elements are functions , but 
a space whose elements are equivalence classes of functions. For the sake of 
simplicity of language, it is, however, customary to relegate this distinction to 
the status of a tacit understanding and to continue to speak of U(p) as a 
space of functions. We shall follow this custom. 

If {/„} is a sequence in L p (p) 9 if/e L p (p) 9 and if lim^^ || f n -f\\ p = 0, we 
say that {/„} converges to f in lF(p) (or that {/„} converges to /in the mean of 
order p 9 or that {/„} is L p -convergent to /). If to every e > 0 there corre- 
sponds an integer N such that \\f n —f m \\ p <€ as soon as n > N and m > N 9 
we call {/„} a Cauchy sequence in These definitions are exactly as in any 
metric space. 

It is a very important fact that U{p) is a complete metric space, i.e., that 
every Cauchy sequence in U(p) converges to an element of lF(p)\ 

3.11 Theorem U(p) is a complete metric space, for 1 < p < oo and for every 
positive measure \i. 

Proof Assume first that 1 < p < oo. Let {/„} be a Cauchy sequence in U(p). 
There is a subsequence {/„.}, < n 2 < •• • , such that 

ll/ni+i —fm lip < (i= 1,2,3,...). 


( 1 ) 
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Put 


9k = £ I fn i+l -f m \, 9= f, l/„ j+1 -fm I- (2) 

i = 1 /= 1 

Since (1) holds, the Minkowski inequality shows that \\g k \\ p < 1 for k = 1, 

2, 3, Hence an application of Fatou’s lemma to {g£} gives \\g\\ p < 1. In 

particular, g(x) < oo a.e., so that the series 

/»,(*) + E (/ ni+1 M -/»,(*)) (3) 

1=1 

converges absolutely for almost every x e X. Denote the sum of (3) by /(x), 
for those x at which (3) converges; put f(x) = 0 on the remaining set of 
measure zero. Since 


fni + Z (/«i+i —/«,) — /w*> (4) 

i = i 

we see that 


/(*) = lim f n .(x) a.e. (5) 

i-+ oo 

Having found a function / which is the poimtwise limit a.e. of {f ni }, we 
now have to prove that this / is the LMimit of {/„}. Choose e > 0. There 
exists an N such that \\f n — f m \\ p < e \i n> N and m> N. For every m> N, 
Fatou’s lemma shows therefore that 

f \f-fm r dg < lim inf f | f ni ~f m \> dg < e'. (6) 

JX i - oo JX 

We conclude from (6) that / —f m e U(g), hence that / e U(g) [since / = 
(f-fm) +/m], and finally that \\f-f m ll p ->0 as m-> oo. This completes the 
proof for the case 1 < p < oo. 

In L?(g) the proof is much easier. Suppose {/„} is a Cauchy sequence in 
L°°(/4 let A k and B m n be the sets where \f k {x)\>\\f k \\ o0 and where 
I /„(*) -fm Ml > II fn -fm II oo , and let E be the union of these sets, for k , m, 

n = 1, 2, 3, Then g(E) — 0, and on the complement of E the sequence {/„} 

converges uniformly to a bounded function f Define /(x) = 0 for x e E. Then 
/eL 00 ^), and \\f n -/IL^ 0 as oo. //// 

The preceding proof contains a result which is interesting enough to be 
stated separately: 

3.12 Theorem If 1 < p < oo and {/„} is a Cauchy sequence in IS(p), with 
limit f then {/„} has a subsequence which converges pointwise almost every- 
where to f (x). 
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The simple functions play an interesting role in LF(p): 

3.13 Theorem Let S be the class of all complex , measurable , simple functions 
on X such that 


l*({x: s(x) 0}) < co. (1) 

VI < p < oo, then S is dense in LF(p). 

Proof First, it is clear that S c= U(p). Suppose /> 0,/e LF(p :), and let { s n } be 
as in Theorem 1.17. Since 0 < s n < f we have s n e LF(p :), hence s n e S. Since 
|/— s„\ p <f p , the dominated convergence theorem shows that 
HZ' — || p — ► 0 as n— ► oo. Thus / is in the Zf-closure of S. The general case 
(/ complex) follows from this. //// 

Approximation by Continuous Functions 

So far we have considered LF(p) on any measure space. Now let X be a locally 
compact Hausdorff space, and let \i be a measure on a a - algebra 9W in X , with 
the properties stated in Theorem 2.14. For example, X might be R k , and p might 
be Lebesgue measure on R k . 

Under these circumstances, we have the following analogue of Theorem 3.13: 

3.14 Theorem For 1 < p < oo, C c (X) is dense in LF(p). 

Proof Define S as in Theorem 3.13. If s e S and e > 0, there exists a g e 
C c (X) such that g(x) = s(x) except on a set of measure <e, and \g \ < ||s || ^ 
(Lusin’s theorem). Hence 


\\g — s|| p < 2€ 1/p ||s|| 00 . (1) 

Since S is dense in B(p ), this completes the proof. //// 

3.15 Remarks Let us discuss the relations between the spaces LF(R k ) (the LF- 
spaces in which the underlying measure is Lebesgue measure on R k ) and the 
space C c (R k ) in some detail. We consider a fixed dimension k. 

For every p e [1, oo] we have a metric on C c (R k ); the distance between / 
and is \\f — g\\ p . Note that this is a genuine metric, and that we do not 
have to pass to equivalence classes. The point is that if two continuous func- 
tions on R k are not identical, then they differ on some nonempty open set V , 
and m(V) > 0, since V contains a /c-cell. Thus if two members of C c (R k ) are 
equal a.e., they are equal. It is also of interest to note that in C c (R k ) the 
essential supremum is the same as the actual supremum : for / e C c (R k ) 

II /II oo = SU P I/Ml- 

xeR* 


(i) 
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If 1 < p < oo, Theorem 3.14 says that C c (R k ) is dense in E(R% and 
Theorem 3.11 shows that E(R k ) is complete. Thus E(R k ) is the completion of 
the metric space which is obtained by endowing C c (R k ) with the E-metric . 

The cases p — 1 and p = 2 are the ones of greatest interest. Let us state 
once more, in different words, what the preceding result says if p = 1 and 
k = 1 ; the statement shows that the Lebesgue integral is indeed the “ right ” 
generalization of the Riemann integral: 

If the distance between two continuous functions f and g , with compact 
supports in R 1 , is defined to be 



\f(t)-g(t)\dt 9 


(2) 


the completion of the resulting metric space consists precisely of the Lebesgue 
integrable functions on R 1 , provided we identify any two that are equal almost 
everywhere. 


Of course, every metric space S has a completion S* whose elements may be 
viewed abstractly as equivalence classes of Cauchy sequences in S (see [26], 
p. 82). The important point in the present situation is that the various E- 
completions of C c (R k ) again turn out to be spaces of functions on R k . 

The case p = oo differs from the cases p < oo. The E° -completion of C c (R k ) is 
not L°{R k ), but is C 0 (R k \ the space of all continuous functions on R k which “ vanish 
at infinity” a concept which will be defined in Sec. 3.16. Since (1) shows that the 
L°°-norm coincides with the supremum norm on C c (R k \ the above assertion 
about C 0 {R k ) is a special case of Theorem 3.17. 

3.16 Definition A complex function / on a locally compact Hausdorff space 
X is said to vanish at infinity if to every € > 0 there exists a compact set 
K c= X such that | f(x) \ < e for all x not in K. 

The class of all continuous / on X which vanish at infinity is called 
C 0 (X). 

It is clear that C c (X) c= C 0 {X\ and that the two classes coincide if X is 
compact. In that case we write C(X) for either of them. 

3.17 Theorem If X is a locally compact Hausdorff space , then C 0 (X) is the 
completion of C£X\ relative to the metric defined by the supremum norm 

ll/ll = sup \f(x)\. (1) 

x e X 


Proof An elementary verification shows that C 0 (X) satisfies the axioms of a 
metric space if the distance between / and g is taken to be \\f — g\\. We have 
to show that (a) C c (X) is dense in C 0 (X) and ( b ) C 0 (X) is a complete metric 
space. 
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Given / e C 0 (X) and 6 > 0, there is a compact set K so that \f(x)\ < e 
outside K. Urysohn’s lemma gives us a function g e C c (X) such that 
0 < g < 1 and g(x) = 1 on K. Put h = fg. Then h e C c {X) and \\f— h\\ < e. 
This proves (a). 

To prove (b), let {/„} be a Cauchy sequence in C 0 (X\ i.e., assume that 
{/„} converges uniformly. Then its pointwise limit function /is continuous. 
Given € > 0, there exists an n so that \\f n — f\\ < e/2 and there is a compact 
set K so that \f n (x)\ < e/2 outside K. Hence \f(x)\ < e outside K , and we 
have proved that / vanishes at infinity. Thus C 0 (X) is complete. //// 

Exercises 

1 Prove that the supremum of any collection of convex functions on ( a , b) is convex on (a, b) (if it is 
finite) and that pointwise limits of sequences of convex functions are convex. What can you say about 
upper and lower limits of sequences of convex functions? 

2 If is convex on (a, b) and if ^ is convex and nondecreasing on the range of (p , prove that \j/ ° (p is 
convex on (a, b). For q> > 0, show that the convexity of log (p implies the convexity of cp, but not vice 
versa. 

3 Assume that (p is a continuous real function on (a, b) such that 

Sx + yX A 1 

(pi J ^ 2 VM + 2 ^ 

for all x and y e (a, b). Prove that (p is convex. (The conclusion does not follow if continuity is omitted 
from the hypotheses.) 

4 Suppose / is a complex measurable function on X, n is a positive measure on X, and 

<p{p ) = Jj/ |, ’^= ll/ll J (0 < p < oo). 

Let E = {p: (p(p) < oo}. Assume WfW^ > 0. 

(u) If r < p < s, r 6 E, and s e E, prove that p e E. 

(i b ) Prove that log q> is convex in the interior of E and that (p is continuous on E. 

(c) By (u), E is connected. Is E necessarily open? Closed? Can E consist of a single point? Can E 
be any connected subset of (0, oo)? 

(d) If r < p < s, prove that ||/|| p < max (||/|| r , ||/|| s ). Show that this implies the inclusion 
m n E{p) c mu). 

( e ) Assume that ||/|| r < oo for some r < oo and prove that 

ll/llp-HI/IL as p— ► oo. 

5 Assume, in addition to the hypotheses of Exercise 4, that 

p(X) = 1. 

{a) Prove that ||/|| r < ||/||,if 0 <r<s< oo. 

{b) Under what conditions does it happen that 0 < r < s < oo and ||/|| r = ||/|| g < oo? 

(c) Prove that E{p) => E{p) if 0 <r < s. Under what conditions do these two spaces contain the 
same functions? 

(i d) Assume that ||/|| r < oo for some r > 0, and prove that 

lim ||/ ||p = exp | j log \f\ dp 
P -*o Ux 


if exp { — oo } is defined to be 0. 
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6 Let m be Lebesgue measure on [0, 1], and define \\f\\ p with respect to m. Find all functions <& on 
[0, oo) such that the relation 


dKlim \\f\\ p )= f(a>o f)dm 

p~*o Jo 


holds for every bounded, measurable, positive / Show first that 


c<D(x) + (1 - c)<D(l) = <D(x c ) (x > 0, 0 < c < 1). 

Compare with Exercise 5(d). 

7 For some measures, the relation r < s implies E(p) c= E{p); for others, the inclusion is reversed; and 
there are some for which E(p) does not contain E(p) if r # s. Give examples of these situations, and 
find conditions on p under which these situations will occur. 

8 If g is a positive function on (0, 1) such that g(x )— ► oo as x— ► 0, then there is a convex function h on 
(0, 1) such that h<g and h{x)— > oo as x— >0. True or false? Is the problem changed if (0, 1) is 
replaced by (0, oo) and x— ► 0 is replaced by x— ► oo? 

9 Suppose / is Lebesgue measurable on (0, 1), and not essentially bounded. By Exercise 4(e), 
H/llp— ► oo as p — ► oo. Can \\f\\ p tend to oo arbitrarily slowly? More precisely, is it true that to every 
positive function $ on (0, oo) such that 4>(p)— ► oo as p — ► oo one can find an / such that ||/|| p — ► oo as 
p— > oo, but ||/ || p < <I>(p) for all sufficiently large p? 

10 Suppose /„ e Zf(p), for n = 1, 2, 3, ..., and ||/„ — /|| p — ► 0 and /„— > g a.e., as n— ► oo. What relation 
exists between / and p? 

11 Suppose p(Q) = 1, and suppose / and g are positive measurable functions on ft such that fg > 1. 
Prove that 

| f dn ■ \ g dn^l. 

Jn Jn 

12 Suppose p(ft) = 1 and h: ft— ► [0, oo] is measurable. If 



prove that 


s/l + ~A 2 < 


J ^/l + A 2 dg £ 1 + A. 


If p is Lebesgue measure on [0, 1] and if h is continuous, h = /', the above inequalities have a simple 
geometric interpretation. From this, conjecture (for general ft) under what conditions on h equality 
can hold in either of the above inequalities, and prove your conjecture. 

13 Under what conditions on / and g does equality hold in the conclusions of Theorems 3.8 and 3.9? 
You may have to treat the cases p — 1 and p = oo separately. 

14 Suppose 1 < p < oo,/ e IS = IF(( 0, oo)), relative to Lebesgue measure, and 


F (x) = - [ f(t) dt 

x Jo 

(< a ) Prove Hardy’s inequality 


(0 < x < oo). 


p - 1 


which shows that the mapping/— ► F carries U into If. 
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ib) Prove that equality holds only iff = 0 a.e. 

(c) Prove that the constant p/ip — 1) cannot be replaced by a smaller one. 

(i d ) Iff > 0 and / e L 1 , prove that F $ L 1 . 

Suggestions: ( a ) Assume first that f > 0 and / e C c ((0, oo)). Integration by parts gives 


J *oo (* oo 

F p (x) dx= -p \ F p ~\x)xF\x) dx. 

0 Jo 

Note that xF' =f — F, and apply Holder’s inequality to J F p_1 /. Then derive the general case, (c) 
Take fix) = x ~ 1/p on [1, A\f{x) = 0 elsewhere, for large A. See also Exercise 14, Chap. 8. 

15 Suppose {tf„} is a sequence of positive numbers. Prove that 


oo 


I 

N= 1 


/i N y / n y °° 

(- y ) <( — — ) y a p 
\N h 7 Vp-7 n = 1 " 


if 1 < p < o o. Hint : If > n n+1 , the result can be made to follow from Exercise 14. This special case 
implies the general one. 

16 Prove Egoroff’s theorem: If p{X) < oo, if {/„} is a sequence of complex measurable functions 
which converges pointwise at every point of X, and if e > 0, there is a measurable set E a X, with 
p(X — E) < e, such that {/„} converges uniformly on E. 

(The conclusion is that by redefining the /„ on a set of arbitrarily small measure we can convert a 
pointwise convergent sequence to a uniformly convergent one; note the similarity with Lusin’s 
theorem.) 

Hint : Put 


S(n,fc)= n jx: \Ux) -ffx)\ 

show that p{S(n, k))— > p{X) as n — ► oo, for each k, and hence that there is a suitably increasing 
sequence {n k } such that E = Q S(n k , k ) has the desired property. 

Show that the theorem does not extend to <r-finite spaces. 

Show that the theorem does extend, with essentially the same proof, to the situation in which the 
sequence {/„} is replaced by a family {/,}, where t ranges over the positive reals; the assumptions are 
now that, for all x e X, 

(i) lim f{x) = f{x) and 

t-* oo 

(ii) t — > f{x) is continuous. 

17 id) IfO < p < oo, put y p = max (1, 2 P ! ), and show that 

I a — /? | p < y p ( | a | p + \p\ p ) 
for arbitrary complex numbers a and /?. 

ib) Suppose p is a positive measure on X, 0 < p < oo, / e L p ip),f n e L p ip), /„(*)-/(*) a.e., and 
||/„||p— ► H/ilp as n— > oo. Show that then lim II/— /„ || p = 0, by completing the two proofs that are 
sketched below. 

(i) By Egoroff’s theorem, X = A u B in such a way that ] A \f\ p <^ MB) < °°> an d /„— ►/ 
uniformly on B. Fatou’s lemma, applied to f B \f n | p , leads to 

lim sup J* | /„ | p dp < e. 

(ii) Put h n = y p i\f\ p + \f„ | p ) - I/- f„ \ p , and use Fatou’s lemma as in the proof of Theorem 
L34 

(c) Show that the conclusion of ib) is false if the hypothesis \\f n \\ p —> \\f\\ p is omitted, even if 
piX) < oo. 
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18 Let \i be a positive measure on X. A sequence {/„} of complex measurable functions on X is said 
to converge in measure to the measurable function / if to every e > 0 there corresponds an N such that 

M{x: I fjx) -/(x) | >«})<« 

for all n> N. (This notion is of importance in probability theory.) Assume p(X) < oo and prove the 
following statements: 

(a) If /„(*) — ►/(*) a.e., then f n — »/ in measure. 

(b) If /„ e E(p) and ||/„ — / || p — ► 0, then f H — ♦/ in measure; here 1 < p < oo. 

(c) If/„ — >/ in measure, then { /„} has a subsequence which converges to / a.e. 

Investigate the converses of (a) and ( b ). What happens to (< a ), (b), and (c) if p{X) = oo, for 
instance, if p is Lebesgue measure on R 1 ? 

19 Define the essential range of a function / e L°°(/u) to be the set R f consisting of all complex 
numbers w such that 


M({x: l/(x) — w | < e}) > 0 

for every e > 0. Prove that R f is compact. What relation exists between the set R f and the number 

ll/L? 

Let A f be the set of all averages 

~k \ fdfi 

M£) J E 

where E eWl and n{E) > 0. What relations exist between A f and R f l Is A f always closed? Are there 
measures fj. siich that A f is convex for every / e L 00 ^)? Are there measures g. such that A f fails to be 
convex for some / £ L 00 ^)? 

How are these results affected if L°°(/u) is replaced by l}(n), for instance? 

20 Suppose q> is a real function on R l such that 

<p(j /(x) dxj < J <p(f) dx 
for every real bounded measurable /. Prove that <p is then convex. 

21 Call a metric space Y a completion of a metric space X if X is dense in Y and Y is complete. In 
Sec. 3.15 reference was made to “the” completion of a metric space. State and prove a uniqueness 
theorem which justifies this terminology. 

22 Suppose X is a metric space in which every Cauchy sequence has a convergent subsequence. Does 
it follow that X is complete? (See the proof of Theorem 3.11.) 

23 Suppose \x is a positive measure on X y p{X) < oo,/ e L°°(/u), WfW^ >0, and 

« n = jjf\"dn (»= 1 , 2 , 3 ,...). 


Prove that 


lim -^= ||/|| .. 

Of 


24 Suppose p is a positive measure,/ e E{p\ g e E(p). 
(a) If 0 < p < 1, prove that 


I 1 


\f\ p -\g\ p 



\f-g\ p dn 


that A (/, g) = j !/ — g\ p du defines a metric on Z?(/r), and that the resulting metric space is complete. 
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(b) If 1 < p < oo and \\f\\ p < R, \\g\\ p ^ R, prove that 

^ \ \f\ p — \d\ p \ dfi £ 2pR p ~ l \\f — g\\ p . 
Hint : Prove first, for x > 0, y > 0, that 




\x-y\ p 

,p\x-y\(x p ~ 1 +y p_1 ) 


if 0 < p < 1, 
if 1 < p < oo. 


Note that ( a ) and ( b ) establish the continuity of the mapping/— ► \f\ p that carries IF(fi) into 
25 Suppose (i is a positive measure on X and /: X — ►((), oo) satisfies f x f d/i = 1. Prove, for every 
E cz X with 0 < ii{E) < oo, that 


and, when 0 < p < 1, 


I, 


(log/) dn < n(E) log 




f p d f x<m i - p - 

E 

26 If/is a positive measurable function on [0, 1], which is larger, 


L> 


r 


f{x) log fix) dx or 


■f 


fis) ds log fit) dt ? 
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Inner Products and Linear Functionals 

4.1 Definition A complex vector space H is called an inner product space (or 
unitary space) if to each ordered pair of vectors x and y e H there is associ- 
ated a complex number (x, y), the so-called “ inner product ” (or “ scalar 
product ”) of x and y, such that the following rules hold: 


(a) (y, x) = (x, y). (The bar denotes complex conjugation.) 

(b) (x + y, z) = (x, z) + (y, z) if x, y, and z e H. 

(c) (ax, y) = a(x, y) if x and y e H and a is a scalar. 

(< d ) (x, x) > 0 for all x e H. 

(e) (x, x) = 0 only if x = 0. 

Let us list some immediate consequences of these axioms : 

(c) implies that (0, y) = 0 for all y e H. 

(b) and (c) may be combined into the statement: For every y e H, the 
mapping x — * (x, y) is a linear functional on H. 

(a) and (c) show that (x, ay) = a(x, y). 

(a) and ( b ) imply the second distributive law: 

(z, x + y) = (z, x) + (z, y). 

By (d), we may define ||x||, the norm of the vector x e H, to be the non- 
negative square root of (x, x). Thus 

(/) II X II 2 = (x, x). 


76 
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4.2 The Schwarz Inequality The properties 4.1 (a) to (d) imply that 

|(x, y)| < 11*11 \\y\\ 


for all x and y e H. 

Proof Put A = ||x|| 2 , £ = |(x, y)|, and C = ||y|| 2 . There is a complex 
number a such that | a | = 1 and a (y, x) = B. For any real r, we then have 

(x — my, x — my) = (x, x) — m(y , x) — ra(x, y) + r 2 (y, y). (1) 

The expression on the left is real and not negative. Hence 

A — 2Br + Cr 2 > 0 (2) 

for every real r. If C = 0, we must have B — 0; otherwise (2) is false for large 
positive r. If C > 0, take r — B/C in (2), and obtain B 2 < AC. //// 

4.3 The Triangle Inequality For x and y e H, we have 

l|x + y||<||x|| + ||y||. 

Proof By the Schwarz inequality, 

II* + yll 2 = (x + y, X + y) = (x, x) + (x, y) + (y, x) + ( y , y) 

<||x|| 2 + 2||x||||y|| + ||y|| 2 =(||x|| + ||y||) 2 . //// 

4.4 Definition It follows from the triangle inequality that 

II x - z|| < II x - yll + II y - z|| (x, y, z e H). (1) 

If we define the distance between x and y to be ||x — y||, all the axioms for a 
.metric space are satisfied; here, for the first time, we use part (e) of Definition 
4.1. 

Thus H is now a metric space. If this metric space is complete , i.e., if every 
Cauchy sequence converges in H , then H is called a Hilbert space . 

Throughout the rest of this chapter, the letter H will denote a Hilbert space. 

4.5 Examples 

(a) For any fixed n, the set C n of all n-tuples 

where £ ls ..., are complex numbers, is a Hilbert space if addition and 
scalar multiplication are defined componentwise, as usual, and if 

(x, y) = Z ZjVj (y = On, ■ ■ ■ , tin))- 

j=i 
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(b) If \i is any positive measure, l3(p) is a Hilbert space, with inner product 

(/ a) = J* fa da- 

The integrand on the right is in 13(g), by Theorem 3.8, so that (/, g) is 
well defined. Note that 

ll/ll =(/,/) 1/2 = {|j/I 2 ^} 1/2 = II/II 2 . 

The completeness of 13(g) (Theorem 3.11) shows that 13(g) is indeed a 
Hilbert space. [We recall that l}(g) should be regarded as a space of 
equivalence classes of functions; compare the discussion in Sec. 3.10.] 

For H = 13(g), the inequalities 4.2 and 4.3 turn out to be special 
cases of the inequalities of Holder and Minkowski. 

Note that Example (a) is a special case of ( b ). What is the measure in 

(a)? 

(c) The vector space of all continuous complex functions on [0, 1] is an 
inner product space if 

(/. 9) = [/(%« dt 

but is not a Hilbert space. 

4.6 Theorem For any fixed y e H, the mappings 

(x, y ), x-> ( y , x), x-> ||x|| 

are continuous functions on H. 

Proof The Schwarz inequality implies that 

I (*i> y) - (* 2 . y) I = I (*i - x 2 , y) | < ||xi - x 2 1| ||.y||, 

which proves that x—> (x, y) is, in fact, uniformly continuous, and the same is 
true for x—> (y, x). The triangle inequality ||x x || < ||x x — x 2 || 4- ||x 2 || yields 

11^1 II - II ^2 II <11^1 “ X 2 \\, 

and if we interchange x x and x 2 we see that 

| ll*i|| - l|x 2 || I < ||x x - x 2 || 

for all x x and x 2 e H. Thus x— ► ||x|| is also uniformly continuous. //// 

4.7 Subspaces A subset M of a vector space V is called a subspace of V if M is 
itself a vector space, relative to the addition and scalar multiplication which are 
defined in V. A necessary and sufficient condition for a set M a V to be a sub- 
space is that x 4 - y e M and ax e M whenever x and yeM and a is a scalar. 
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In the vector space context, the word “subspace” will always have this 
meaning. Sometimes, for emphasis, we may use the term “linear subspace” in 
place of subspace. 

For example, if F is the vector space of all complex functions on a set S 9 the 
set of all bounded complex functions on S is a subspace of F, but the set of all 
f e V with | f(x) | < 1 for all x e S is not. The real vector space R 3 has the follow- 
ing subspaces, and no others: (a) R 3 , (b) all planes through the origin 0, (c) all 
straight lines through 0, and (d) {0}. 

A closed subspace of H is a subspace that is a closed set relative to the topol- 
ogy induced by the metric of H. 

Note that if M is a subspace of if, so is its closure M. To see this, pick x and y 
in M and let a be a scalar. There are sequences {x„} and {y„} in M that converge 
to x and y, respectively. It is then easy to verify that x n + y n and ax„ converge to 
* + y and ax, respectively. Thus x + y e M and ax e Af. 

4.8 Convex Sets A set E in a vector space F is said to be convex if it has the 
following geometric property: Whenever x e £, y e £, and 0 < t < 1, the point 

2 , = (1 - t)x + ty 

also lies in E. As t runs from 0 to 1, one may visualize z t as describing a straight 
line segment in F, from x to y. Convexity requires that E contain the segments 
between any two of its points. 

It is clear that every subspace of F is convex. 

Also, if E is convex, so is each of its translates 

E + x = {y + x: y e E}. 


4.9 Orthogonality If (x, y) — 0 for some x and y g if , we say that x is orthogonal 
to y, and sometimes write x _L y. Since (x, y) = 0 implies (y, x) = 0, the relation ± 
is symmetric. 

Let x 1 denote the set of all y e H which are orthogonal to x; and if M is a 
subspace of if, let Af 1 be the set of all y e H which are orthogonal to every 
x g Af. 

Note that x 1 is a subspace of if, since x ± y and x ± y' implies x _L (y + y') 
and x _L ay. Also, x 1 is precisely the set of points where the continuous function 
y— ► (x, y) is 0. Hence x 1 is a closed subspace of H. Since 

m x = n x 1 , 

x e M 

M 1 is an intersection of closed subspaces, and it follows that M 1 is a closed 
subspace of H. 

4.10 Theorem Every nonempty , closed , convex set E in a Hilbert space H con- 
tains a unique element of smallest norm. 



80 REAL AND COMPLEX ANALYSIS 


In other words, there is one and only one x 0 e E such that ||x 0 || < ||x|| for 
every x e E. 


Proof An easy computation, using only the properties listed in Definition 
4.1, establishes the identity 

II X + >>ll 2 + II X - y \\ 2 = 2MI 2 + 2||.y|| 2 (xandyetf). (1) 

This is known as the parallelogram /aw: If we interpret ||x|| to be the length 
of the vector x, (1) says that the sum of the squares of the diagonals of a 
parallelogram is equal to the sum of the squares of its sides, a familiar propo- 
sition in plane geometry. 

Let S = inf {|| x || : x e E}. For any x and y e E, we apply (1) to jx and \y 
and obtain 


ill*- 


■yll 


2 = ilMI 2 + ill 


x + y 


( 2 ) 


Since E is convex, (x + y)/2 e E. Hence 

II x — y|| 2 < 2||x|| 2 + 2||y|| 2 — 4<5 2 (x and y e E). (3) 

If also ||x|| = ||y|| = <5, then (3) implies x = y, and we have proved the unique- 
ness assertion of the theorem. 

The definition of d shows that there is a sequence {y„} in E so that 
lljFnll — ^ ^ as n — > oo. Replace x and y in (3) by y n and y m . Then, as n— ► oo and 
ra— ► oo, the right side of (3) will tend to 0. This shows that {y n } is a Cauchy 
sequence. Since H is complete, there exists an x 0 e H so that >x 0 , i.e., 
|| y n — x 0 1| — > 0, as n— ► oo. Since y n e E and E is closed, x 0 e E. Since the 
norm is a continuous function on H (Theorem 4.6), it follows that 

||x 0 || = lim \\yj = //// 

n~* oo 


4.1 1 Theorem Let M be a closed subspace of a Hilbert space H. 

(a) Every x e H has then a unique decomposition 

x = Px + Qx 

into a sum of Px e M and Qx e M 1 . 

(b) Px and Qx are the nearest points to x in M and in M 1 , respectively. 

(c) The mappings P: //— ► M and Q : //— ► M 1 are linear. 

(d) \\x\\ 2 = \\Px\\ 2 + ||Qx|| 2 . 

Corollary If M ^ H, then there exists y e H, y ^ 0, such that y _L M. 
P and Q are called the orthogonal projections of H onto M and M 1 . 
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Proof As regards the uniqueness in (a), suppose that x' + y' = x" + y" for 
some vectors x', x" in M and /, y" in M 1 . Then 

x' — x" = y" — y'. 

Since x' — x" e M, y" — y' e M 1 , and M n M 1 = {0} [an immediate conse- 
quence of the fact that (x, x) = 0 implies x = 0], we have x" = x', y" = y'. 

To prove the existence of the decomposition, note that the set 

x + M = {x + y: y e M] 

is closed and convex. Define Qx to be the element of smallest norm in 
x + M; this exists, by Theorem 4.10. Define Px = x — Qx. 

Since Qx e x + A/, it is clear that Px e M. Thus P maps H into M. 

To prove that Q maps H into M 1 we show that (Qx, y) = 0 for all y e M. 
Assume ||y|| = 1, without loss of generality, and put z = Qx. The minimizing 
property of Qx shows that 

(z, z) = ||z|| 2 < || z - ay || 2 = (z - ay, z - ay) 

for every scalar a. This simplifies to 

0 < —a (y, z) — a (z, y) + aa. 

With a = (z, y ), this gives 0 < — | (z, y) | 2 , so that (z, y) = 0. Thus Qx e M 1 . 
We have already seen that Px e M. If y e M, it follows that 

II* - yll 2 = lie* + (Px - y) II 2 = IIG*II 2 + II Px - y \\ 2 

which is obviously minimized when y = Px. 

We have now proved (a) and (b). If we apply (a) to x, to y, and to 
ax + 0y, we obtain 

P(ax + Py) - a Px - pPy = aQx + PQy - Q{ ax + Py). 

The left side is in M, the right side in M 1 . Hence both are 0, so P and Q are 
linear. 

Since Px _L Qx, (d) follows from (a). 

To prove the corollary, take x e H, x $ M, and put y = Qx. Since 
Px e M, x 7 ^ Px, hence y = x — Px ^ 0. //// 

We have already observed that x->(x, y) is, for each y e H, a continuous 
linear functional on H. It is a very important fact that all continuous linear 
functionals on H are of this type. 

4.12 Theorem If L is a continuous linear functional on H, then there is a 
unique y e H such that 


Lx = (x, y) (x e H). 


(i) 
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Proof If Lx = 0 for all x, take y = 0. Otherwise, define 


M = {x: Lx = 0}. (2) 

The linearity of L shows that M is a subspace. The continuity of L shows 
that M is closed. Since Lx ^ 0 for some x e H, Theorem 4.11 shows that Af 1 
does not consist of 0 alone. 

Hence there exists z e Af 1 , with ||z|| = 1. Put 

u = (Lx)z — (Lz)x. (3) 

Since Lu = ( Lx)(Lz ) — (Lz)(Lx) = 0, we have u e M. Thus (w, z) = 0. This 
gives 


Lx = (LxXz, z) = (Lz)(x, z). (4) 

Thus (1) holds with y = az, where a = Lz. 

The uniqueness of y is easily proved, for if (x, y) = (x, y') for all x e H, set 
z = y — y'; then (x, z) = 0 for all x e H; in particular, (z, z) = 0, hence z = 0. 

//// 


Orthonormal Sets 


4.13 Definitions If F is a vector space, if x 1? . . . , x k e F, and if c l5 , c fc are 

scalars, then c ^ + ••• + c fc x fc is called a /inear combination of x l5 ..., x k . 
The set {x l5 . . . , xj is called independent if c t x t + • • • + c k x k = 0 implies that 
c t = • • • = c k = 0. A set S c= V is independent if every finite subset of S is 
independent. The set [5] of all linear combinations of all finite subsets of S 
(also called the set of all finite linear combinations of members of 5) is clearly 
a vector space; [5] is the smallest subspace of V which contains S; [5] is 
called the span of 5, or the space spanned by 5. 

A set of vectors u a in a Hilbert space H , where a runs through some 
index set A, is called orthonormal if it satisfies the orthogonality relations 
(u a , Up) = 0 for all a ^ /?, a e A, and ft e A, and if it is normalized so that 
|| wj = 1 for each a e A. In other words, {u a } is orthonormal provided that 



if a = fi, 
if a 


( 1 ) 


If {w a : a e A} is orthonormal, we associate with each x e H a complex 
function x on the index set A, defined by 


x(a) = (x, u a ) (a e A). (2) 

One sometimes calls the numbers x(a) the Fourier coefficients of x, relative to 
the set {wj. 

We begin with some simple facts about finite orthonormal sets. 
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4.14 Theorem Suppose that {u a : a e A} is an orthonormal set in H and that F 
is a finite subset of A. Let M F be the span of {w a : a e F}. 

(a) If (p is a complex function on A that is 0 outside F, then there is a 
vector y e M F , namely 


y = Z <p( a K 

a e F 

that has y(a) = cp( a) for every a e A. Also , 

( 1 ) 

\\y\\ 2 = I M*)l 2 . 

a e F 

(b) If x e H and 

( 2 ) 

M*) = z x(<x)u x 

ace F 

then 

( 3 ) 

II x - Sfix ) II < || x - s|| 

for every s e M F , except for s = s F (x), and 

( 4 ) 

Z |x(a)| 2 < l|x|| 2 . 

( 5 ) 


ace F 


Proof Part (a) is an immediate consequence of the orthogonality relations 
4.13(1). 

In the proof of (b), let us write s F in place of s^x), and note that s^a) = 
x(a) for all a e F. This says that (x — s F ) JL u a if a e F, hence (x — s F ) _L 
(s F — s) for every seM f , and therefore 

\\x - s|| 2 = II (x - Sp) + (Sp - s)|| 2 = II x - s F || 2 + ||s F - s|| 2 . (6) 

This gives (4). With s = 0, (6) gives ||s F || 2 < ||x|| 2 , which is the same as (5), 
because of (2). //// 

The inequality (4) states that the “partial sum” s F (x) of the “Fourier series” 
£ x(oc)u a of x is the unique best approximation to x in M F , relative to the metric 
defined by the Hilbert space norm. 

4.15 We want to drop the finiteness condition that appears in Theorem 4.14 
(thus obtaining Theorems 4.17 and 4.18) without even restricting ourselves to sets 
that are necessarily countable. For this reason it seems advisable to clarify the 
meaning of the symbol £ ag 4 <p( a) when a ranges over an arbitrary set A. 

Assume 0 < (p( a) < oo for each a e A. Then 

E <?(«) (*) 

a e A 

denotes the supremum of the set of all finite sums <p(a t ) + • • • + <p(a„), where 
a n are distinct members of A. 
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A moment’s consideration will show that the sum (1) is thus precisely the 
Lebesgue integral of (p relative to the counting measure p on A. 

In this context one usually writes f p (A) for LF(p). A complex function q> with 
domain A is thus in £ 2 {A) if and only if 

E l#*)l 2 < co. (2) 

ae A 

Example 4.5(b) shows that tf 2 (A) is a Hilbert space, with inner product 

( <P > <A) = Z <p(o#(a)- (3) 

a e A 

Here, again, the sum over A stands for the integral of (p\f with respect to the 
counting measure; note that (pif e ^(A) because cp and ij/ are in £ 2 (A). 

Theorem 3.13 shows that the functions cp that are zero except on some finite 
subset of A are dense in f 2 (A). 

Moreover, if <p e / 2 (A), then {a e A: cp( a) ^ 0} is at most countable. For if A n 
is the set of all a where | <p(a ) | > 1/n, then the number of elements of Ai s at most 

Z I n< p(°o 1 2 < « 2 E k(«) I 2 - 

a e A n a e A 

Each A n (n = 1, 2, 3, . . .) is thus a finite set. 

The following lemma about complete metric spaces will make it easy to pass 
from finite orthonormal sets to infinite ones. 

4.16 Lemma Suppose that 

{a) X and Y are metric spaces , X is complete, 

(b) /: X—> Y is continuous, 

(c) X has a dense subset X 0 on which f is an isometry, and 

(d) f(X 0 ) is dense in Y. 

Then f is an isometry of X onto Y. 

The most important part of the conclusion is that / maps X onto all of Y. 
Recall that an isometry is simply a mapping that preserves distances. Thus, 
by assumption, the distance between f(x x ) and f(x 2 ) in Y is equal to that between 
Xj and x 2 in X, for all points x 1} x 2 in X 0 . 

Proof The fact that / is an isometry on X is an immediate consequence of 
the continuity of f since X 0 is dense in X. 

Pick y e Y. Since f(X 0 ) is dense in Y, there is a sequence {x n } in X 0 such 
that f(x n )—> y as n— > oo. Thus {f(x n )} is a Cauchy sequence in Y. Since /is an 
isometry on X 0 , it follows that {x„} is also a Cauchy sequence. The com- 
pleteness of X implies now that {x„} converges to some x e X, and the con- 
tinuity of / shows that f(x) = lim f(x n ) = y. //// 



ELEMENTARY HILBERT SPACE THEORY 85 


4.17 Theorem Let {u a : oc e A) be an orthonormal set in H , and let P be the 
space of all finite linear combinations of the vectors u a . 

The inequality 


Z l*(a)| 2 < ||x|| 2 (1) 

a e A 

holds then for every x e H, and x— ► x is a continuous linear mapping of H onto 
f 2 (A) whose restriction to the closure P of P is an isometry of P onto f 2 (A). 

Proof Since the inequality 4.14(5) holds for every finite set F cz A, we have 
(1), the so-called Bessel inequality. 

Define / on H by /(x) = x. Then (1) shows explicitly that / maps H into 
*f 2 (A). The linearity of /is obvious. If we apply (1) to x — y we see that 

11/00 -/Mil 2 = II y - *112 ^ II y - *11- 

Thus /is continuous. Theorem 4.14(a) shows that /is an isometry of P onto 
the dense subspace of <f 2 (A) consisting of those functions whose support is a 
finite set F <= A. The theorem follows therefore from Lemma 4.16, applied 
with X = P, X 0 = P, Y = f 2 (A); note that P, being a closed subset of the 
complete metric spate H , is itself complete. //// 

The fact that the mapping x— > x carries H onto f 2 (A) is known as the Riesz- 
Fischer theorem . 

4.18 Theorem Let {u a : a e A} be an orthonormal set in H. Each of the follow- 
ing four conditions on {u a } implies the other three : 

(i) {w a } is a maximal orthonormal set in H. 

(ii) The set P of all finite linear combinations of members of {w a } is dense in H. 

(iii) The equality 


Z |x(a)| 2 = ||x|| 2 


holds for every x e H. 

(iv) The equality 

Z *( a )Ka) = (x, y) 

a e A 

holds for all x e H and y e H. 

The last formula is known as ParsevaVs identity. Observe that x and y are in 
f 2 (A\ hence xy is in ^(A), so that the sum in (iv) is well defined. Of course, (iii) is 
the special case x = y of (iv). 

Maximal orthonormal sets are often called complete orthonormal sets or 
orthonormal bases. 
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Proof To say that {u a } is maximal means simply that no vector of H can be 
adjoined to {u a } in such a way that the resulting set is still orthonormal. This 
happens precisely when there is no x ^ 0 in H that is orthogonal to every u a . 

We shall prove that (i)— ► (ii) — ► (iii) — ► (iv) — ► (i). 

If P is not dense in H , then its closure P is not all of H , and the corollary 
to Theorem 4.11 implies that P 1 contains a nonzero vector. Thus {u a } is not 
maximal when P is not dense, and (i) implies (ii). 

If (ii) holds, so does (iii), by Theorem 4.17. 

The implication (iii) — ► (i v) follows from the easily proved Hilbert space 
identity (sometimes called the “ polarization identity ”) 

4(x, y) = || x + )>|| 2 - ||x - >>|| 2 + i||x + O' II 2 - i||x - O’ II 2 

which expresses the inner product (x, y) in terms of norms and which is 
equally valid with 3c, y in place of x, y, simply because tf 2 (A) is also a Hilbert 
space. (See Exercise 19 for other identities of this type.) Note that the sums in 

(iii) and (iv) are ||x||| and (x, y), respectively. 

Finally, if (i) is false, there exists u ^ 0 in H so that (w, u a ) — 0 for all 
a e A. If x = y = w, then (x, y) = ||w|| 2 > 0 but x(a) = 0 for all a e A, hence 

(iv) fails. Thus (iv) implies (i), and the proof is complete. //// 

4.19 Isomorphisms Speaking informally, two algebraic systems of the same 
nature are said to be isomorphic if there is a one-to-one mapping of one onto the 
other which preserves all relevant properties. For instance, we may ask whether 
two groups are isomorphic or whether two fields are isomorphic. Two vector 
spaces are isomorphic if there is a one-to-one linear mapping of one onto the 
other. The linear mappings are the ones which preserve the relevant concepts in a 
vector space, namely, addition and scalar multiplication. 

In the same way, two Hilbert spaces H x and H 2 are isomorphic if there is a 
one-to-one linear mapping A of H x onto H 2 which also preserves inner products: 
(Ax, Ay) = (x, y) for all x and y e H v Such a A is an isomorphism (or, more 
specifically, a Hilbert space isomorphism) of onto H 2 . Using this terminology, 
Theorems 4.17 and 4.18 yield the following statement: 

If {u a : (x e A} is a maximal orthonormal set in a Hilbert space H , and ifx( a) = 
(x, w a ), then the mapping x — ► x is a Hilbert space isomorphism of H onto £ 2 {A). 

One can prove (we shall omit this) that tf 2 (A) and f 2 (B) are isomorphic if and 
only if the sets A and B have the same cardinal number. But we shall 'prove that 
every nontrivial Hilbert space (this means that the space does not consist of 0 
alone) is isomorphic to some tf 2 (A), by proving that every such space contains a 
maximal orthonormal set (Theorem 4.22). The proof will depend on a property of 
partially ordered sets which is equivalent to the axiom of choice. 

4.20 Partially Ordered Sets A set ^ is said to be partially ordered by a binary 
relation < if 
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(a) a < b and b < c implies a < c. 

(b) a < a for every ae^. 

(c) a < b and b < a implies a = b. 


A subset J of a partially ordered set & is said to be totally ordered (or lin- 
early ordered) if every pair a 9 b e £ satisfies either a < b or b < a. 

For example, every collection of subsets of a given set is partially ordered by 
the inclusion relation <= . 

To give a more specific example, let & be the collection of all open subsets of 
the plane, partially ordered by set inclusion, and let J be the collection of all 
open circular discs with center at the origin. Then J c= Q is totally ordered by 
<= , and i is a maximal totally ordered subset of This means that if any 
member of & not in J is adjoined to J, the resulting collection of sets is no 
longer totally ordered by c= . 


4.21 The Hausdorff Maximally Theorem Every nonempty partially ordered 
set contains a maximal totally ordered subset. 


This is a consequence of the axiom of choice and is, in fact, equivalent to it; 
another (very similar) form of it is known as Zorn’s lemma. We give the proof in 
the Appendix. 

If now H is a nontrivial Hilbert space, then there exists a u e H with ||u|| = 1, 
so that there is a nonempty orthonormal set in H. The existence of a maximal 
orthonormal set is therefore a consequence of the following theorem : 


4.22 Theorem Every orthonormal set B in a Hilbert space H is contained in a 
maximal orthonormal set in H. 


Proof Let & be the class of all orthonormal sets in H which contain the 
given set B. Partially order & by set inclusion. Since 5 e ^ ^ 0. Hence 
& contains a maximal totally ordered class Q. Let S be the union of all 
members of Q. It is clear that B <= S. We claim that S is a maximal orthonor- 
mal set : 

If and u 2 e S, then e and u 2 e A 2 for some A± and A 2 e Q. 
Since Q is total ordered, A t a A 2 (or A 2 a A x ), so that u l e A 2 and u 2 e A 2 . 
Since A 2 is orthonormal, (u l9 u 2 ) = 0 if ^ u 2 , (u l9 u 2 ) = 1 if = u 2 . Thus 
S is an orthonormal set. 

Suppose S is not maximal. Then S is a proper subset of an orthonormal 
set S*. Clearly, S* £ Q, and S* contains every member of Q. Hence we may 
adjoin S* to Q and still have a total order. This contradicts the maximality 
of Q. //// 
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Trigonometric Series 

4.23 Definitions Let T be the unit circle in the complex plane, i.e., the set of 
all complex numbers of absolute value 1. If F is any function on T and if /is 
defined on R 1 by 


m = F{e% (1) 

then / is a periodic function of period 2n. This means that f(t + 2 n) = f(t) for 
all real t. Conversely, if/ is a function on JR 1 , with period 2n , then there is a 
function F on T such that (1) holds. Thus we may identify functions on T 
with 27r-periodic functions on JR 1 ; and, for simplicity of notation, we shall 
sometimes write f(t) rather than f(e l % even if we think of / as being defined 
on T. 

With these conventions in mind, we define L P (T), for 1 < p < oo, to be 
the class of all complex, Lebesgue measurable, 27i-periodic functions on R 1 
for which the norm 


11 / 11 ,= 



( 2 ) 


is finite. 

In other words, we are looking at where \i is Lebesgue measure on 
[0, 27c] (or on T), divided by 2n. L°°(T) will be the class of all 27r-periodic 
members of L 00 ^ 1 ), with the essential supremum norm, and C(T) consists of 
all continuous complex functions on T (or, equivalently, of all continuous, 
complex, 27r-periodic functions on R 1 ), with norm 

II /II oo = sup | /(t) | , (3) 

t 

The factor l/(27c) in (2) simplifies the formalism we are about to develop. 
For instance, the L p -norm of the constant function 1 is 1. 

A trigonometric polynomial is a finite sum of the form 

N 

f(t) = a 0 + £ ( a n cos nt + b n sin nt) ( t e R 1 ) (4) 

n = 1 

where a 0 , a l9 . . . , a N and b u . . . , b N are complex numbers. On account of the 
Euler identities, (4) can also be written in the form 

m= £ c„e iM (5) 

n = - N 

which is more convenient for most purposes. It is clear that every trigono- 
metric polynomial has period 2 n. 
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We shall denote the set of all integers (positive, zero, and negative) by Z, 
and put 

Kit) = e 1 "' (n e Z). (6) 

If we define the inner product in L 2 (T) by 

if 9) = ^ J /WsW ^ (7) 

[note that this is in agreement with (2)], an easy computation shows that 


u m ) = 


e i{n ~ m)t dt 


1 if n = m, 
0 if n 7^ m. 


Thus {u„:ne Z} is an orthonormal set in L 2 (T), usually called the trigono- 
metric system. We shall now prove that this system is maximal, and shall then 
derive concrete versions of the abstract theorems previously obtained in the 
Hilbert space context. 


4.24 The Completeness of the Trigonometric System Theorem 4.18 shows that the 
maximality (or completeness) of the trigonometric system will be proved as soon 
as we can show that the set of all trigonometric polynomials is dense in L 2 (T). 
Since C(T) is dense in L 2 (T), by Theorem 3.14 (note that T is compact), it suffices 
to show that to every f e C(T) and to every e > 0 there is a trigonometric poly- 
nomial P such that ||/ — P || 2 < e. Since ||gf|| 2 < Halloo for every g e C(T), the esti- 
mate ||/ — P\\ 2 < € will follow from \\f — PW^ < e, and it is this estimate which 
we shall prove. 

Suppose we had trigonometric polynomials Q u Q 2 , Q3, . . with the follow- 
ing properties : 

(a) Q k (t) > 0 for t e JR 1 . 

(b) ~ J Q k (t) dt=\. 

(c) Ifr] k (d) = sup {Q k (t): d < \ t \ < n}, then 

lim rj k (S) = 0 

k ~* 00 

for every S > 0. 

Another way of stating ( c ) is to say: for every S > 0, Q k (t )— » 0 uniformly on 

[ — 7T, — <5] U IS, 7l], 

To each/ e C(T) we associate the functions P k defined by 

PM = £ I* fit - ds ( k = 1, 2, 3, . . .). (1) 



90 REAL AND COMPLEX ANALYSIS 


If we replace s by — s (using Theorem 2.20(e)) and then by s — r, the periodicity of 
/ and Q k shows that the value of the integral is not affected. Hence 

P k (t) = 2- f(s)Q k (t - s) ds (k = 1, 2, 3, . . .)• (2) 

Since each Q k is a trigonometric polynomial, Q k is of the form 

N k 

Qk(t)= X a n, k e in \ (3) 

n=~N k 

and if we replace t by t — s in (3) and substitute the result in (2), we see that each 
P k is a trigonometric polynomial. 

Let 6 > 0 be given. Since / is uniformly continuous on T, there exists a S > 0 
such that \f(t)—f(s)\<e whenever 1 1 — s \ < S. By (b\ we have 

pm -/(f) = L J' {/( f - s) -/(t)}e*( S ) ds, 

and (a) implies, for all r, that 

I fit - s) -fit) | e*(s) ds= A t +A 2 , 


\p k it)-m\ < 


2n 


where is the integral over [ — <5, <5] and A 2 is the integral over [ — n 9 — <5] u 
[^, 7t]. In Xj, the integrand is less than €Q k (s), so A t < e, by ( b ). In A 2 , we have 
Qkis) < hence 

A 2 <2\\f\\ x -r, k (d)<€ (4) 

for sufficiently large /c, by (c). Since these estimates are independent of r, we have 
proved that 

lim ||/- PJ oo=0. (5) 

fc -00 


It remains to construct the Q k . This can be done in many ways. Here is a 
simple one. Put 


GfcM — c k 


1 + COS t 


( 6 ) 


where c k is chosen so that (b) holds. Since (a) is clear, we only need to show (c). 
Since Q k is even, (b) shows that 


•iff 


4- cos t 


c k 

dt> — 

71 


1 -f cos t) k . . 2 c k 

sin t dt = 


n(k 4- 1) 


Since is decreasing on [0, 7c], it follows that 


Q k (t) < Q k (S) < 


n(k 4- 1) ^1 4- cos S^ k 


(0 < S < 1 1 1 < n). 


(7) 
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This implies (c), since 1 + cos <5<2ifO<<5<7i. 

We have proved the following important result: 

4.25 Theorem If f e C(T) and e > 0, there is a trigonometric polynomial P 
such that 


I fit) - P(t) I < e 


for every real t. 

A more precise result was proved by Fejer (1904): The arithmetic means of the 
partial sums of the Fourier series of any f e C(T) converge uniformly to f For a 
proof (quite similar to the above) see Theorem 3.1 of [45], or p. 89 of [36], vol. I. 

4.26 Fourier Series For any / e L*(T), we define the Fourier coefficients of /by the 
formula 

/(») = ^f /<*-* (»eZ), (1) 

where, we recall, Z is the set of all integers. We thus associate with each f e I}(T) 
a function / on Z. The Fourier series of /is 

£ (2) 


and its partial sums are 

SnW = £ hn)e iM (AT = 0, 1,2,...). (3) 

~N 

Since L 2 (T) c Lf(T), (1) can be applied to every / e L 2 (T). Comparing the defi- 
nitions made in Secs. 4.23 and 4.13, we can now restate Theorems 4.17 and 4.18 
in concrete terms : 

The Riesz-Fischer theorem asserts that if {c n } is a sequence of complex 
numbers such that 


£ I c„ I 2 < OO, (4) 

n— — oo 

then there exists an f e L 2 (T) such that 

° n= h r— (nez) - (5) 

The Parseval theorem asserts that 

£ f(n)g(n) = / f f(t)g{t) dt (6) 

n = - oo ^ J-n 
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whenever / e L 2 (T) and g e L 2 (T); the series on the left of (6) converges absolu- 
tely; and if s N is as in (3), then 


lim II/- s*|| 2 =0, (7) 

N -* co 

since a special case of (6) yields 

II/- % 111= Z l/(«)l 2 . (8) 

l"l>N 

Note that (7) says that every / e L 2 (T) is the L 2 -limit of the partial sums of its 
Fourier series; i.e., the Fourier series of / converges to /, in the L 2 -sense. Pointwise 
convergence presents a more delicate problem, as we shall see in Chap. 5. 

The Riesz-Fischer theorem and the Parseval theorem may be summarized by 
saying that the mapping /<-»/is a Hilbert space isomorphism of L 2 (T) onto / 2 (Z). 

The theory of Fourier series in other function spaces, for instance in l}(T\ is 
much more difficult than in L 2 (T), and we shall touch only a few aspects of it. 

Observe that the crucial ingredient in the proof of the Riesz-Fischer theorem 
is the fact that L 2 is complete. This is so well recognized that the name “Riesz- 
Fischer theorem ” is sometimes given to the theorem which asserts the complete- 
ness of L 2 , or even of any IF. 

Exercises 

In this set of exercises, H always denotes a Hilbert space. 

1 If M is a closed subspace of H, prove that M = ( M ± ) ± . Is there a similar true statement for sub- 
spaces M which are not necessarily closed ? 

2 Let {x„: n= 1, 2, 3, ...} be a linearly independent set of vectors in H. Show that the following 
construction yields an orthonormal set {u n } such that {x l5 x N } and { u lt . .., u N } have the same 
span for all N. 

Put u t = Xj/Hxj ||. Having u n ^ l define 

n - 1 

V „ = X n- Z (*«’ W iK> U n = V JW V nl 

i=l 

Note that this leads to a proof of the existence of a maximal orthonormal set in separable 
Hilbert spaces which makes no appeal to the Hausdorff maximality principle. (A space is separable if 
it contains a countable dense subset.) 

3 Show that H{T) is separable if 1 < p < oo, but that L°°(r) is not separable. 

4 Show that H is separable if and only if H contains a maximal orthonormal system which is at most 
countable. 

5 If M = {x: Lx = 0}, where L is a continuous linear functional on H , prove that M 1 is a vector 
space of dimension 1 (unless M = H). 

6 Let {u n } {n = 1, 2, 3, . . .) be an orthonormal set in H. Show that this gives an example of a closed 
and bounded set which is not compact. Let Q be the set of all x e H of the form 

oo 

* =Z C n u n 


^ where | c n \ < 


Prove that Q is compact. ( Q is called the Hilbert cube.) 



ELEMENTARY HILBERT SPACE THEORY 93 


More generally, let {<5„} be a sequence of positive numbers, and let S be the set of all x e H of 
the form 


* = £ c„ (where | c J < <5„). 

1 

Prove that 5 is compact if and only if Y? S 2 < oo. 

Prove that H is not locally compact. 

7 Suppose {tf„} is a sequence of positive numbers such that Y a n b n < °° whenever b n > 0 and 
Y b 2 < oo. Prove that Y a l < °°- 

Suggestion: If Y a n = 00 l^en there are disjoint sets E k ( k = 1, 2, 3, .. .) so that 

I al > I. 

ne Ek 

Define b n so that b n = c k a n for n e E k . For suitably chosen c k , Y a n b„ = oo although Y bl < oo. 

8 If H l and H 2 are two Hilbert spaces, prove that one of them is isomorphic to a subspace of the 
other. (Note that every closed subspace of a Hilbert space is a Hilbert space.) 

9 If A c: [0, In'] and A is measurable, prove that 

lim cos nx dx = lim sin nx dx = 0. 

n-> oo JA w-oo JA 

10 Let < n 2 < n 3 < • • be positive integers, and let E be the set of all x e [0, 2n] at which 
{sin n k x} converges. Prove that m(£) = 0. Hint : 2 sin 2 a = 1 — cos 2a, so sin n k x— > ± 1/^/2 a.e. on E, 
by Exercise 9. 

11 Find a nonempty closed set E in l3(T) that contains no element of smallest norm. 

12 The constants c k in Sec. 4.24 were shown to be such that k~ l c k is bounded. Estimate the relevant 
integral more precisely and show that 


0 < lim k 1/2 c k < oo. 


13 Suppose / is a continuous function on/? 1 , with period 1. Prove that 


lim 77 E /("«)= [ /(*) dt 

N-> oo N n=l Jo 


for every irrational real number a. Hint : Do it first for 

f{t) = exp (27«7cr), k = 0, ±1, ±2, — 

14 Compute 


min | x 3 — a 
a, b, c J- 1 


— bx — cx 2 1 2 dx 


and find 


max 


f x 3 #(x) dx , 


where g is subject to the restrictions 


J g{x) dx = J xg{x) dx = J x 2 #(x) dx = 0; J | g{x)\ 2 dx = 1. 
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15 Compute 

f 00 

min | x 3 — a — bx — cx 2 \ 2 e x dx. 

a , b, c JO 

State and solve the corresponding maximum problem, as in Exercise 14. 

16 If x 0 e H and M is a closed linear subspace of H, prove that 

min {||x - x 0 ||: x e M} = max {|(x 0 , y)\ : y e M\ ||y|| = 1}. 

17 Show that there is a continuous one-to-one mapping y of [0, 1] into H such that y{b) — y(a) is 
orthogonal to y{d) — y{c) whenever 0 < a <b <, c < d < 1. (y may be called a “curve with orthogonal 
increments.”) Hint : Take H = L 2 , and consider characteristic functions of certain subsets of [0, 1]. 

18 Define u s (t) — e lst for all s e R 1 , t e R 1 . Let X be the complex vector space consisting of all finite 
linear combinations of these functions u s . Iff e X and g e X, show that 

(/, g) = Jim L J * 

exists. Show that this inner product makes X into a unitary space whose completion is a non- 
separable Hilbert space H. Show also that {u s : s e R 1 } is a maximal orthonormal set in H. 

19 Fix a positive integer N, put co = e 2ni,N , prove the orthogonality relations 

if * = ° 

N n =i [0 if 1 < k < N - 1 
and use them to derive the identities 

( x ,y) = -r: E II* + (tfyfef 

™ n= 1 

that hold in every inner product space if N ;> 3. Show also that 

{x,y)= h J* + de - 



CHAPTER 

FIVE 

EXAMPLES OF BANACH SPACE TECHNIQUES 


Banach Spaces 

5.1 In the preceding chapter we saw how certain analytic facts about trigonomet- 
ric series can be made to emerge from essentially goemetric considerations about 
general Hilbert spaces, involving the notions of convexity, subspaces, orthog- 
onality, and completeness. There are many problems in analysis that can be 
attacked with greater ease when they are placed within a suitably chosen abstract 
framework. The theory of Hilbert spaces is not always suitable since orthogonality 
is something rather special. The class of all Banach spaces affords greater variety. 
In this chapter we shall develop some of the basic properties of Banach spaces 
and illustrate them by applications to concrete problems. 

5.2 Definition A complex vector space X is said to be a normed linear space if 
to each x e X there is associated a nonnegative real number ||x||, called the 
norm of x, such that 

(a) || x + y\\ < || at || + ||y|| for all x and y e X, 

(i b ) || ax || = | a | ||x|| if x e X and a is a scalar, 

(c) ||x|| =0 implies x = 0. 

By (a), the triangle inequality 

II X - .HI < II X - z|| + || z - y || (x, y,z e X) 

holds. Combined with ( b ) (take a = 0, a = — 1) and (c) this shows that every 
normed linear space may be regarded as a metric space, the distance between 
x and y being ||x — y ||. 

A Banach space is a normed linear space which is complete in the metric 
defined by its norm. 
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For instance, every Hilbert space is a Banach space, so is every B(p) 
normed by \\f\\ p (provided we identify functions which are equal a.e.) if 
1 < p < oo, and so is C 0 (X) with the supremum norm. The simplest Banach 
space is of course the complex field itself, normed by ||x|| = | x | . 

One can equally well discuss real Banach spaces; the definition is exactly 
the same, except that all scalars are assumed to be real. 

5.3 Definition Consider a linear transformation A from a normed linear 
space X into a normed linear space 7, and define its norm by 

|| A|| = sup .{|| Ax || : x e X, ||x|| < 1}. (1) 

If || A || < oo, then A is called a bounded linear transformation. 

In (1), ||x|| is the norm of x in X 9 ||Ax|| is the norm of Ax in 7 ; it will 
frequently happen that several norms occur together, and the context will 
make it clear which is which. 

Observe that we could restrict ourselves to unit vectors x in (1), i.e., to x 
with ||x|| = 1, without changing the supremum, since 

II A(ax)|| = ||aAx|| = | a 1 1| Ax||. (2) 

Observe also that ||A|| is the smallest number such that the inequality 

II Ax || < || A|| ||x|| (3) 


holds for every x e X. 

The following geometric picture is helpful : A maps the closed unit ball in 
X , i.e., the set 

{x e X: ||x|| < 1}, (4) 

into the closed ball in Y with center at 0 and radius || A||. 

An important special case is obtained by taking the complex field for Y ; 
in that case we talk about bounded linear functionals. 

5.4 Theorem For a linear transformation A of a normed linear space X into a 
normed linear space 7, each of the following three conditions implies the other 
two: 

(a) A is bounded. 

(b) A is continuous. 

(c) A is continuous at one point of X. 

Proof Since || A(x x — x 2 )|| < ||A|| ||x x — x 2 IU it is clear that (a) implies (b), 
and (b) implies (c) trivially. Suppose A is continuous at x 0 . To each e > 0 one 
can then find a S > 0 so that ||x — x 0 || <3 implies || Ax — Ax 0 || < e. In other 
words, ||x|| <3 implies 


II A(x 0 + x) - Ax 0 || < e. 
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But then the linearity of A shows that ||Ax|| < e. Hence ||A|| < e/d, and (c) 
implies (a). //// 

Consequences of Baire’s Theorem 

5.5 The manner in which the completeness of Banach spaces is frequently 
exploited depends on the following theorem about complete metric spaces, which 
also has many applications in other parts of mathematics. It implies two of the 
three most important theorems which make Banach spaces useful tools in 
analysis, the Banach-Steinhaus theorem and the open mapping theorem. The third 
is the Hahn-Banach extension theorem , in which completeness plays no role. 

5.6 Baire’s Theorem If X is a complete metric space , the intersection of every 
countable collection of dense open subsets of X is dense in X. 

In particular (except in the trivial case X = 0), the intersection is not empty. 
This is often the principal significance of the theorem. 

Proof Suppose V l9 V 2 , V 3 , . . . are dense and open in X. Let W be any open 
set in X. We have to show that V n has a point in W if W # 0. 

Let p be the metric of X ; let us write 

S(x, r) = {y e X : p(x, y) < r} (1) 

and let S(x, r) be the closure of S(x, r). INote : There exist situations in which 
S(x, r) does not contain all y with p(x, y) < r!] 

Since V t is dense, W n V t is a nonempty open set, and we can therefore 
find x i and such that 

S(x i, r Y ) a W n V t and 0 < r t < 1. (2) 

If n > 2 and x n _ i and r n _ l are chosen, the denseness of V n shows that V„ n 
S(x„_ !, r„_ i) is not empty, and we can therefore find x„ and r„ such that 

S(x n , r n ) cF n n S(x n _ l9 r n _ J and 0<r n <K (3) 

By induction, this process produces a sequence {x„} in X. If i > n and 
j > n, the construction shows that x t and Xj both lie in S(x n , r„), so that 
p(Xj, Xj) < 2 r n < 2/n, and hence {x n } is a Cauchy sequence. Since X is com- 
plete, there is a point x e X such that x = lim x„ . 

n-* oo 

Since x t lies in the closed set S(x n , r n ) if i > n , it follows that x lies in each 
S(x„ , r„), and (3) shows that x lies in each V n . By (2), x e W. This completes 
the proof. //// 

Corollary In a complete metric space , the intersection of any countable collec- 
tion of dense G d ’s is again a dense G s . 
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This follows from the theorem, since every G d is the intersection of a count- 
able collection of open sets, and since the union of countably many countable 
sets is countable. 


5.7 Baire’s theorem is sometimes called the category theorem , for the following 
reason. 

Call a set E c= X nowhere dense if its closure E contains no nonempty open 
subset of X. Any countable union of nowhere dense sets is called a set of the first 
category ; all other subsets of X are of the second category (Baire’s terminology). 
Theorem 5.6 is equivalent to the statement that no complete metric space is of the 
first category. To see this, just take complements in the statement of Theorem 5.6. 

5.8 The Banach-Steinhaus Theorem Suppose X is a Banach space , Y is a 
normed linear space , and {A a } is a collection of bounded linear transformations 
of X into 7, where a ranges over some index set A. Then either there exists an 
M < oo such that 


II A J < M 


for every oc e A, or 


sup ||A a x|| = oo 

a e A 


(1) 

(2) 


for all x belonging to some dense G d in X. 

In geometric terminology, the alternatives are as follows: Either there is a 
ball B in Y (with radius M and center at 0) such that every A a maps the unit ball 
of X into £, or there exist x e X (in fact, a whole dense G d of them) such that no 
ball in Y contains A a x for all a. 

The theorem is sometimes referred to as the uniform boundedness principle. 

Proof Put 


(p(x) = sup II A a X II (x 6 X) (3) 

a e A 


and let 


V„ = {x : <p(x) >n} (n = 1, 2, 3, . . .)• (4) 

Since each A a is continuous and since the norm of Y is a continuous function 
on Y (an immediate consequence of the triangle inequality, as in the proof of 
Theorem 4.6), each function x— ► ||A a x|| is continuous on X. Hence cp is lower 
semicontinuous, and each V n is open. 
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If one of these sets, say V N , fails to be dense in X , then there exist an 
x 0 e X and an r > 0 such that ||x|| < r implies x 0 + x £ V N ; this means that 
<p(x 0 + x) < N, or 


l|A«(x 0 + x)\\<N (5) 

for all a e A and all x with ||x|l < r. Since x = (x 0 + x) — x 0 , we then have 

II A a x|| < II A a (x 0 + x)|| + ||A.Xoll < 2 N, (6) 

and it follows that (1) holds with M = 2 N/r. 

The other possibility is that every V n is dense in X. In that case, P| V„ is a 
dense G s in X , by Baire’s theorem; and since <p(x) = 00 for every xe f) V n , 

the proof is complete. //// 

5.9 The Open Mapping Theorem Let U and V be the open unit balls of the 
Banach spaces X and Y. To every bounded linear transformation A of X onto 
Y there corresponds a 5 > 0 so that 

A (U)^SV. (1) 

Note the word “ onto ” in the hypothesis. The symbol <5 V stands for the set 

y: y e V} 9 i.e., the set of all y e Y with ||y|| < <5. 

It follows from (1) and the linearity of A that the image of every open ball in 
, with center at x 0 , say, contains an open ball in Y with center at Ax 0 . Hence 
e image of every open set is open. This explains the name of the theorem. 

Here is another way of stating (1): To every y with ||y || < S there corresponds 
x with || x || < 1 so that Ax = y. 

Proof Given y e Y, there exists an x e X such that Ax = y; if ||x|| < k, it 
follows that y e A(kU). Hence Y is the union of the sets A (kU\ for 
k= 1, 2, 3, ... . Since Y is complete, the Baire theorem implies that there is a 
nonempty open set W in the closure of some A (kU). This means that every 
point of W is the limit of a sequence {AxJ, where x ( - e kU ; from now on, k 
and W are fixed. 

Choose y 0 e W, and choose rj > 0 so that y 0 4 - y e W if ||y|| < rj. For 
any such y there are sequences {xj}, {x-'} in kU such that 

Ax; -> y 0 , Ax- -> y 0 + y (i -> oc). (2) 

Setting x f = x" — xj, we have ||x £ || < 2k and Ax t — >y. Since this holds for 
every y with ||y|| < rj , the linearity of A shows that the following is true, if 
d = rj/2k: 

T o each y e Y and to each € > 0 there corresponds an x e X such that 

||x|| < <5 _1 ll}i and \\y - Ax|| < e. (3) 

This is almost the desired conclusion, as stated just before the start of the 
proof, except that there we had 6 = 0. 
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Fix y e SV, and fix e > 0. By (3) there exists an x t with \\x x || < 1 and 

Hy-AxJ <{Se. (4) 

Suppose x l9 . . . , x„ are chosen so that 

<2-36. (5) 

Use (3), with y replaced by the vector on the left side of (5), to obtain anx„ +1 
so that (5) holds with n -F 1 in place of n, and 

IK + ill <2-"£ (n= 1,2,3,...). (6) 

If we set s n = Xi + • • • + x„, (6) shows that {s„} is a Cauchy sequence in 
X. Since X is complete, there exists an x e X so that s„— ► x. The inequality 
llxjl < 1, together with (6), shows that ||x|| < 1 + e. Since A is continuous, 
A s n — ► Ax. By (5), As„— ► y. Hence Ax = y. 

We have now proved that 

A((l 4- €)U) 3 SV, (7) 

or 

A((/)D(l+e)-MF, (8) 

for every e > 0. The union of the sets on the right of (8), taken over all e > 0, 
is SV. This proves (1). //// 

5.10 Theorem If X and Y are Banach spaces and if A is a bounded linear 
transformation of X onto Y which is also one-to-one , then there is a S > 0 such 
that 


II Ax || > <5 1| x || (xeX). (1) 

In other words , A -1 is a bounded linear transformation of Y onto X. 

Proof If S is chosen as in the statement of Theorem 5.9, the conclusion of 
that theorem, combined with the fact that A is now one-to-one, shows that 
||Ax|| < S implies ||x|| < 1. Hence ||x|| > 1 implies ||Ax|| > S , and (1) is 
proved. 

The transformation A -1 is defined on Y by the requirement that 
A' 1 }; = x if y = Ax. A trivial verification shows that A -1 is linear, and (1) 
implies that || A _ 1 1| < 1/S. //// 

Fourier Series of Continuous Functions 

5.11 A Convergence Problem Is it true for every f e C(T) that the Fourier series of 
f converges to f(x) at every point x? 
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Let us recall that the nth partial sum of the Fourier series of / at the point x 
is given by 


S„(A X) = j* f(t)D n (x — t) dt (n = 0, 1, 2, . . .), (1) 

where D n (t) = £ e ikt . (2) 

k= -n 

This follows directly from formulas 4.26(1) and 4.26(3). 

The problem is to determine whether 

lim s„(/; x) =/(x) (3) 

n-* oo 

for every / e C(T) and for every real x. We observed in Sec. 4.26 that the partial 
sums do converge to / in the L 2 -norm, and Theorem 3.12 implies therefore that 
each / e L 2 (T) [hence also each / e C(T)] is the pointwise limit a.e. of some sub- 
sequence of the full sequence of the partial sums. But this does not answer the 
present question. 

We shall see that the Banach-Steinhaus theorem answers the question nega- 
tively. Put 


«*(/; x) = sup |s„(/; x)|. (4) 

n 

To begin with, take x = 0, and define 

A ./= sJJ\ 0) (fe C(T); n = 1,2,3 ,.. .). (5) 

We know that C(T) is a Banach space, relative to the supremum norm WfW^. It 
follows from (1) that each A„ is a bounded linear functional on C(T), of norm 

11 AJ -h\ n lDM dt= llDnh - (6) 

We claim that 


l|AJ|-*oo as oo. (7) 

This will be proved by showing that equality holds in (6) and that 


ll-I^nll i — * 00 as n— ► oo. (8) 

Multiply (2) by e it/2 and by e~ it/2 and subtract one of the resulting two equa- 
tions from the other, to obtain 


DM = 


sin (n + $)t 
sin (t/2) 


(9) 
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Since | sin x \ < \ x | for all real x , (9) shows that 



which proves (8). 

Next, fix n , and put g(t) = 1 if D n (t) > 0, g(t) = — 1 if D n (t) < 0. There exist 
fj e C(T) such that — 1 <fj < 1 and //f)— ► g(t) for every f, as^*— ► oo. By the domi- 
nated convergence theorem, 

lim A n (fj) = lim f f fj(-t)D n (t) dt = f df = ||Z) II || 1 . 

oo j-> ao ^71 J-n 271 J- n 

Thus equality holds in (6), and we have proved (7). 

Since (7) holds, the Banach-Steinhaus theorem asserts now that s*(f ; 0) = oo 
for every /in some dense G d - set in C(T). 

We chose x = 0 just for convenience. It is clear that the same result holds for 
every other x : 

To each real number x there corresponds a set E x c= C(T) which is a dense G d 
in C(T), such that s*(f; x) = oo for every f e E x . 

In particular, the Fourier series of each / e E x diverges at x , and we have a 
negative answer to our question. (Exercise 22 shows the answer is positive if mere 
continuity is replaced by a somewhat stronger smoothness assumption.) 

It is interesting to observe that the above result can be strengthened by 
another application of Baire’s theorem. Let us take countably many points x t , 
and let E be the intersection of the corresponding sets 

E Xi cz C(T). 

By Baire’s theorem, £ is a dense G d in C(T). Every / e E has 

s*(/; x,) = oo 

at every point x { . 

For each / s*(f ; x) is a lower semicontinuous function of x , since (4) exhibits 
it as the supremum of a collection of continuous functions. Hence 
{x: s*(f ; x) = oo} is a G d in R l , for each / If the above points x, are taken so that 
their union is dense in ( — n, n), we obtain the following result: 

5.12 Theorem There is a set E a C{T) which is a dense G d in C(T) and which 
has the following property: For each f e E, the set 

Q f = {x: s*(f; x) = 00 } 


is a dense G s in R l . 
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This gains in interest if we realize that £, as well as each Q fi is an uncount- 
able set : 

5.13 Theorem In a complete metric space X which has no isolated points , no 
countable dense set is a G s . 

Proof Let x k be the points of a countable dense set E in X. Assume that E is 
a G d . Then E = Q V n , where each V n is dense and open. Let 

K=K- U {**}• 

fc= 1 

Then each W n is still a dense open set, but {\W n — 0, in contradiction to 
Baire’s theorem. //// 

Note : A slight change in the proof of Baire’s theorem shows actually that 
every dense G 8 contains a perfect set if X is as above. 

Fourier Coefficients of L 1 -functions 

5.14 As in Sec. 4.26, we associate to every / e l}(T) a function /on Z defined by 

/(n) = ^j* f(t)e~ im dt (neZ). (1) 

It is easy to prove that /(n)— ► 0 as | n | — ► oo, for every f el}. For we know that 
C(T) is dense in l}(T) (Theorem 3.14) and that the trigonometric polynomials are 
dense in C(T) (Theorem 4.25). If e > 0 and / e l}(T ), this says that there is a 
g e C(T) and a trigonometric polynomial P such that \\f—g\\i<e and 
II 9 ~ P\\oo< Since 


||0-P||i<||0-P|| 


if follows that \\f — P\h < 2c; and if | n \ is large enough (depending on P), then 


\f(n)\ = 


2n 


{/(t) - P(t)}e~ iM dt 


<||/-P|| 1 <2e 


(2) 


Thus/(n) — ► 0 as n— ► ±oo. This is known as the Riemann-Lebesgue lemma. 

The question we wish to raise is whether the converse is true. That is to say, 
if {a n } is a sequence of complex numbers such that a„— ► 0 as n— ► ±oo, does it 
follow that there is an f e l}(T) such that f(n) = a n for all n e Z? In other words, 
is something like the Riesz-Fischer theorem true in this situation? 

This can easily be answered (negatively) with the aid of the open mapping 
theorem. 
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Let c 0 be the space of all complex functions (p on Z such that <p(n)— >0 as 
n— ► ± oo, with the supremum norm 

ML = sup {I<P(«)|: n e Z}. (3) 

Then c 0 is easily seen to be a Banach space. In fact, if we declare every subset of 
Z to be open, then Z is a locally compact Hausdorff space, and c 0 is nothing but 
C 0 (Z). 

The following theorem contains the answer to our question : 

5.15 Theorem The mapping f—>f is a one-to-one bounded linear transformation 
of I}(T) into ( but not onto) c 0 . 

Proof Define A by A/ = / It is clear that A is linear. We have just proved 
that A maps l}(T) into c 0 , and formula 5.14(1) shows that \f(n)\ < \\f\\ u so 
that || A|| < 1. (Actually, ||A|| = 1; to see this, take / = 1.) Let us now prove 
that A is one-to-one. Suppose / e Lf(T) and f(n) = 0 for every n e Z. Then 

" f(t)g(t) dt = 0 (1) 

J-n 

if g is any trigonometric polynomial. By Theorem 4.25 and the dominated 
convergence theorem, (1) holds for every g e C(T). Apply the dominated con- 
vergence theorem once more, in conjunction with the Corollary to Lusin’s 
theorem, to conclude that (1) holds if g is the characteristic function of any 
measurable set in T. Now Theorem 1.39(h) shows that /= 0 a.e. 

If the range of A were all of c 0 , Theorem 5.10 would imply the existence 
of a S > 0 such that 


ll/lloo> <511/11 1 (2) 

for every / e L X (T). But if D n (t) is defined as in Sec. 5.11, then D n e L X (T), 
|| DJI qo = 1 for n = 1, 2, 3, ..., and ||D„|| 1 ^ oo as n— ► oo. Hence there is no 
S > 0 such that the inequalities 

llA.IL ><5IIA,lli (3) 


hold for every n. 

This completes the proof. 


mi 


The Hahn-Banach Theorem 

5.16 Theorem If M is a subspace of a normed linear space X and if f is a 
bounded linear functional on M, then f can be extended to a bounded linear 
functional F on X so that ||F|| = ||/||. 


Note that M need not be closed. 
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Before we turn to the proof, some comments seem called for. First, to say (in 
the most general situation) that a function F is an extension of / means that the 
domain of F includes that of / and that F(x) = / (x) for all x in the domain of f 
Second, the norms ||F|| and ||/|| are computed relative to the domains of F and 
/ ; explicitly, 

ll/ll = sup {|/(x)|:xeM, ||x|| < 1}, ||F|| = sup { | F(x)|: x e X, ||x|| < 1}, 

The third comment concerns the field of scalars. So far everything has been 
stated for complex scalars, but the complex field could have been replaced by the 
real field without any changes in statements or proofs. The Hahn-Banach 
theorem is also true in both cases; nevertheless, it appears to be essentially a 
“ real ” theorem. The fact that the complex case was not yet proved when Banach 
wrote his classical book “ Operations lineaires ” may be the main reason that real 
scalars are the only ones considered in his work. 

It will be helpful to introduce some temporary terminology. Recall that V is 
a complex (real) vector space if x + y e V for x and y e V, and if ax e V for all 
complex (real) numbers a. It follows trivially that every complex vector space is 
also a real vector space. A complex function cp on a complex vector space V is a 
complex-linear functional if 

cp(x + y) = (p(x) + <p(y ) and cp( ax) = a cp(x) (1) 

for all x and y e V and all complex a. A real-valued function cp on a complex 
(real) vector space V is a real-linear functional if (1) holds for all real a. 

If u is the real part of a complex-linear functional f i.e., if u(x) is the real part 
of the complex number /(x) for all x e V, it is easily seen that u is a real-linear 
functional. The following relations hold between / and u: 

5.17 Proposition Let V be a complex vector space. 

(a) If u is the real part of a complex-linear functional f on V , then 

f(x) = u(x) — iu(ix) (x e V). (1) 

(b) If u is a real-linear functional on V and if f is defined by (1), then f is a 
complex-linear functional on V. 

(c) If V is a normed linear space and f and u are related as in (1), then 

ll/ll = Ml- 

Proof If a and f are real numbers and z = a 4- if, the real part of iz is — f. 

This gives the identity 

z = Re z — i Re (iz) (2) 

for all complex numbers z. Since 

Re (if (x)) = Re / (ix) = w(ix), 

(1) follows from (2) with z = /(x). 


( 3 ) 
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Under the hypotheses (b\ it is clear that /(x + y) —f(x) + f(y) and that 
/ (ax) = a/ (x) for all real a. But we also have 

f(ix) — u(ix) — iu( — x) = u(ix) + iu(x) — if(x ), (4) 

which proves that /is complex-linear. 

Since |w(x)| < |/(x)|, we have \\u\\ < ||/||. On the other hand, to every 
x g V there corresponds a complex number a, | a | = 1, so that a/(x) = | /(x) |. 
Then 

|/(x)| =/(ax) = w(ax) < ||w|| • ||ax|| = \\u\\ • ||x||, (5) 

which proves that ||/|| < ||w||. //// 

5.18 Proof of Theorem 5.16 We first assume that X is a real normed linear 
space and, consequently, that / is a real-linear bounded functional on Af . If 
|| /|| = 0, the desired extension is F = 0. Omitting this case, there is no loss of 
generality in assuming that ||/|| = 1. 

Choose x 0 g X, x 0 £ M, and let M l be the vector space spanned by Af 
and x 0 . Then Af x consists of all vectors of the form x + 2x 0 , where x e Af 
and A is a real scalar. If we define / x (x + 2x 0 ) = /(x) + 2a, where a is any 
fixed real number, it is trivial to verify that an extension of / to a linear 
functional on is obtained. The problem is to choose a so that the 
extended functional still has norm 1. This will be the case provided that 

|/(x) + 2a | < || x + 2x 0 || (x e M, 1 real). (1) 

Replace x by —Xx and divide both sides of (1) by |A|. The requirement is 
then that 


| /(x) - a | < ||x - x 0 1| (x e Af), (2) 

i.e., that A x < a < B x for all x g Af, where 

A x =f(x) - ||x - x 0 || and B x =/(x) + ||x - x 0 ||. (3) 

There exists such an a if and only if all the intervals \_A X , B x ~] have a common 
point, i.e., if and only if 

A x < B y (4) 

for all x and y g Af. But 

fix) —fiy) —f{x -y)< \\x - y|| < ||x - x 0 || + ||y - x 0 II, (5) 
and so (4) follows from (3). 


We have now proved that there exists a norm-preserving extension of / 

on M v 

Let & be the collection of all ordered pairs (Af', /'), where Af' is a sub- 
space of X which contains Af and where /' is a real-linear extension of / to 
Af', with || /'|| = 1. Partially order ^ by declaring (Af',/') < (Af",/") to mean 
that Af' Af " and /"(x) = /'(x) for all x g Af'. The axioms of a partial order 
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are clearly satisfied, & is not empty since it contains (M,/), and so the Haus- 
dorff maximality theorem asserts the existence of a maximal totally ordered 
subcollection Q of 

Let O be the collection of all M' such that (M',/') e Q. Then O is totally 
ordered, by set inclusion, and therefore the union M of all members of O is a 
subspace of X. (Note that in general the union of two subspaces is not a 
subspace. An example is two planes through the origin in R 3 .) If x e M, then 
x e M' for some M e <D; define F(x) = /'(*), where /' is the function which 
occurs in the pair (M',/') e Q. Our definition of the partial order in Q shows 
that it is immaterial which M' e O we choose to define F(x), as long as AF 
contains x. 

It is now easy to check that F is a linear functional on M, with ||F|| = 1. 
If M were a proper subspace X , the first part of the proof would give us a 
further extension of F, and this would contradict the maximality of Q. Thus 
M = X, and the proof is complete for the case of real scalars. 

If now / is a complex-linear functional on the subspace M of the complex 
normed linear space X , let u be the real part of f use the real Hahn-Banach 
theorem to extend u to a real-linear functional U on X 9 with \\U\\ = ||u||, and 
define 


F(x) = U(x) - iU(ix) (x e X ). (6) 

By Proposition 5.17, F is a complex-linear extension of /, and 

11*1 = \W\\ = ||u|| = ||/||. 

This completes the proof. //// 

Let us mention two important consequences of the Hahn-Banach theorem: 

5.19 Theorem Let M be a linear subspace of a normed linear space X , and let 
x 0 e X . Then x 0 is in the closure M of M if and only if there is no bounded 
linear functional f on X such that /(x) = 0 for all x e M but /(x 0 ) ^ 0. 

Proof If x 0 e M, / is a bounded linear functional on X , and /(x) = 0 for all 
x e M, the continuity of / shows that we also have /(x 0 ) = 0. 

Conversely, suppose x 0 $ M. Then there exists a S > 0 such that 
II x — x 0 1| >3 for all x e M. Let M' be the subspace generated by M and x 0 , 
and define /(x + Ax 0 ) = Aif x e M and X is a scalar. Since 

5|A| ^lAMIxo + A^xll = \\Xx 0 + x||, 

we see that / is a linear functional on M' whose norm is at most <5 _1 . Also 
/(x) = 0 on M,/(x 0 ) = 1. The Hahn-Banach theorem allows us to extend this 
/ from Af ' to X. //// 


5.20 Theorem If X is a normed linear space and if x 0 e X, x 0 ^ 0, there is a 
bounded linear functional f on X , of norm 1, so that f(x 0 ) = ||x 0 ||. 
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Proof Let M = {/bc 0 }, and define /(>bc 0 ) = A||x 0 ||. Then / is a linear function- 
al of norm 1 on M, and the Hahn-Banach theorem can again be applied. //// 

5.21 Remarks If X is a normed linear space, let X * be the collection of all 
bounded linear functionals on X. If addition and scalar multiplication of 
linear functionals are defined in the obvious manner, it is easy to see that X* 
is again a normed linear space. In fact, X * is a Banach space; this follows 
from the fact that the field of scalars is a complete metric space. We leave the 
verification of these properties of X * as an exercise. 

One of the consequences of Theorem 5.20 is that X * is not the trivial 
vector space (i.e., X * consists of more than 0 ) if X is not trivial. In fact, X* 
separates points on X. This means that if x x ^ x 2 in X there exists an /el* 
such that /(x x ) ^ /(x 2 )- To prove this, merely take x 0 = x 2 — x t in Theorem 
5 . 20 . 

Another consequence is that, for xel, 

Ml = sup {|/(x)|:/e X*, ||/|| = 1}. 

Hence, for fixed xel, the mapping /—►/(*) is a bounded linear functional 
on X*, of norm ||x||. 

This interplay between X and X* (the so-called “dual space” of X) 
forms the basis of a large portion of that part of mathematics which is known 
as functional analysis. 

An Abstract Approach to the Poisson Integral 

5.22 Successful applications of the Hahn-Banach theorem to concrete problems 
depend of course on a knowledge of the bounded linear functionals on the 
normed linear space under consideration. So far we have only determined the 
bounded linear functionals on a Hilbert space (where a much simpler proof of 
the Hahn-Banach theorem exists; see Exercise 6), and we know the positive 
linear functionals on C c (X). 

We shall now describe a general situation in which the last-mentioned func- 
tionals occur naturally. 

Let K be a compact Hausdorff space, let H be a compact subset of K , and let 
A be a subspace of C(K) such that 1 e A (1 denotes the function which assigns 
the number 1 to each x e K) and such that 

11/11* =11/11* (/e4 (1) 

Here we used the notation 

\\f\\ E = sup {|/(x)|: x 6 £}. (2) 

Because of the example discussed in Sec. 5.23, H is sometimes called a bound- 
ary of K , corresponding to the space A. 
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If/ e A and x e K, (1) says that 

l/WI< II/IIh. (3) 

In particular, if f(y) = 0 for every y e H, then f(x) = 0 for all x e K. Hence if / x 
and f 2 e A and //y) = / 2 (y) for every y e H, then / x =/ 2 ; to see this, put / = 

/i — /2 • 

Let M be the set of all functions on H that are restrictions to H of members 
of A. It is clear that M is a subspace of C(H). The preceding remark shows that 
each member of M has a unique extension to a member of A. Thus we have a 
natural one-to-one correspondence between M and A, which is also norm- 
preserving, by (1). Hence it will cause no confusion if we use the same letter to 
designate a member of A and its restriction to H. 

Fix a point x e K. The inequality (3) shows that the mapping /—»/(x) is a 
bounded linear functional on M, of norm 1 [since equality holds in (3) if / = 1]. 
By the Hahn-Banach theorem there is a linear functional A on C(H ), of norm 1, 
such that 


< 

II 

2 

(fe M). 

(4) 

We claim that the properties 

A1 = 1, 

II A | = 1 

(5) 


imply that A is a positive linear functional on C(H). 

To prove this, suppose / e C(H), 0 </< 1, put g = 2/ — 1, and put 
Kg = a + i^ where a and /? are real. Note that — 1 < g < 1, so that 
| g + ir | 2 < 1 + r 2 for every real constant r. Hence (5) implies that 

(P + r) 2 < |<x + <05 + r)| 2 = | A(g + ir)| 2 < 1 + r 2 . (6) 

Thus p 2 + 2rj5 < 1 for every real r, which forces P = 0. Since ||gr|| H < 1, we have 
| a | < 1 ; hence 

A f= |A(1 + = Kl + a) > 0. (7) 

Now Theorem 2.14 can be applied. It shows that there is a regular positive 
Borel measure p x on H such that 

A/= (/e C(H)). (8) 

In particular, we get the representation formula 

/(x) = ^/^ (f e A). (9) 

What we have proved is that to each x e K there corresponds a positive 
measure p x on the “boundary ” H which “represents ’’ x in the sense that (9) holds 
for every f e A. 
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Note that A determines n x uniquely; but there is no reason to expect the 
Hahn-Banach extension to be unique. Hence, in general, we cannot say much 
about the uniqueness of the representing measures. Under special circumstances 
we do get uniqueness, as we shall see presently. 

5.23 To see an example of the preceding situation, let U = {z: |z| < 1} be the 
open unit disc in the complex plane, put K = U (the closed unit disc), and take 
for H the boundary T of U. We claim that every polynomial /, i.e., every function 
of the form 


f(z)= £ a n z ", 

n = 0 

where a 0 , . . . , a N are complex numbers, satisfies the relation 

ll/ll i/ = ll/llr- 


(1) 

( 2 ) 


(Note that the continuity of / shows that the supremum of | / 1 over U is the same 
as that over U.) 

Since U is compact, there exists a z 0 e U such that |/(z 0 )| > \f(z)\ for all 
z e U. Assume z 0 e U. Then 


/(z)= X h n( Z ~ Z o)"> 
and if 0 < r < 1 — | z 0 1 , we obtain 


N 

I 

n = 0 


(3) 


I \K 


2 r 2n 


if 

J-, 


| f(z 0 + re ie ) | 2 d0< 


2 71 


|/(z 0 )| 2 dd = \b 0 \ 


so that = b 2 = • • • = b N = 0; i.e., /is constant. Thus z 0 e T for every noncon- 
stant polynomial /, and this proves (2). 

(We have just proved a special case of the maximum modulus theorem; we 
shall see later that this is an important property of all holomorphic functions.) 


5.24 The Poisson Integral Let A be any subspace of C(U) (where U is the closed 
unit disc, as above) such that A contains all polynomials and such that 

ll/ll i/ = ll/llr (1) 

holds for every / e A. We do not exclude the possibility that A consists of 
precisely the polynomials, but A might be larger. 

The general result obtained in Sec. 5.22 applies to A and shows that to each 
z e JJ there corresponds a positive Borel measure \i z on T such that 

/(z)=jV^ 2 (fe A). (2) 

(This also holds for zgT, but is then trivial : \i z is simply the unit mass concen- 
trated at the point z.) 
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We now fix z e U and write z = re ld , 0 < r < 1, 6 real. 

If w„(w) = w", then u n e A for n = 0, 1, 2, . . . ; hence (2) shows that 

rV" e = I u n dp 2 (n = 0, 1, 2, . . .)• (3) 

Jt 

Since w_„ = u n on T, (3) leads to 

U dn z = r |n| e‘" 9 (n = 0, ± 1, ± 2, . . .). (4) 

This suggests that we look at the real function 


P r (9 - t) = X ( f real), (5) 

n— — oo 


since 


27T 


r, 


P r (^ - t)e int dt = r |w| e in0 


(n = 0, ±1, ±2, ...). 


(6) 


Note that the series (5) is dominated by the convergent geometric series £ r |w| , so 
that it is legitimate to insert the series into the integral (6) and to integrate term 
by term, which gives (6). Comparison of (4) and (6) gives 


/^,-r- f f(e u )P r (8 - t) dt 
271 J-n 


(7) 


for / = u n , hence for every trigonometric polynomial /, and Theorem 4.25 now 
implies that (7) holds for every / e C(T). [This shows that \i z was uniquely deter- 
mined by (2). Why?] 

In particular, (7) holds iff e A, and then (2) gives the representation 

m = f * w - *) dt (/ e *)■ (8) 


The series (5) can be summed explicitly, since it is the real part of 


OO 


1 + 2 X ( ze-‘r = 

1 


e lt + z 
e lt — z 


1 — r 2 + 2 ir sin (0 — t) 
|1 -ze- lt l 2 


Thus 


P r (0-t) = 


1 -r 2 

1 — 2r cos (0 — t) + r 2 


( 9 ) 


This is the so-called “ Poisson kernel.” Note that P r (0 — t)>0if0<r< 1. 
We now summarize what we have proved : 
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5.25 Theorem Suppose A is a vector space of continuous complex functions on 
the closed unit disc U. If A contains all polynomials , and if 

sup|/(z)| = sup I /(z) I (1) 

zeU ze T 


for every f e A ( where T is the unit circle , the boundary of U\ then the Poisson 
integral representation 


'W-S 


1 -r ! 


1—2 r cos (9 — t) + r 


f(e h ) dt (z = re ie ) 


(2) 


is valid for every f e A and every z e U. 


Exercises 

1 Let X consist of two points a and b , put MW) = MW) = h and let L p (/u) be the resulting real 
Zf-space. Identify each real function / on X with the point (/(a), f{b)) in the plane, and sketch the unit 
balls of for 0 < p < oo. Note that they are convex if and only if 1 < p < oo. For which p is this 
unit ball a square? A circle? If MW) # p(b), how does the situation differ from the preceding one? 

2 Prove that the unit ball (open or closed) is convex in every normed linear space. 

3 If 1 < p < oo, prove that the unit ball of L%u) is strictly convex ; this means that if 

\\f\\ P =\\g\\ P = l /# g, h = &f+g\ 

then \\h\\ p < 1. (Geometrically, the surface of the ball contains no straight lines.) Show that this fails in 
every l}(p), in every L 00 ^), and in every C(X). (Ignore trivialities, such as spaces consisting of only one 
point.) 

4 Let C be the space of all continuous functions on [0, 1], with the supremum norm. Let M consist of 
all f e C for which 


C 1/2 r i 

At) dt - At) dt = 1 . 

Jo J 1/2 

Prove that M is a closed convex subset of C which contains no element of minimal norm. 

5 Let M be the set of all / e ^([O, 1]), relative to Lebesgue measure, such that 

£/W dt = 1. 

Show that M is a closed convex subset of ^([O, 1]) which contains infinitely many elements of 
minimal norm. (Compare this and Exercise 4 with Theorem 4.10.) 

6 Let / be a bounded linear functional on a subspace M of a Hilbert space H. Prove that / has a 
unique norm-preserving extension to a bounded linear functional on H, and that this extension van- 
ishes on M 1 . 

7 Construct a bounded linear functional on some subspace of some L}{p) which has two (hence 
infinitely many) distinct norm-preserving linear extensions to l}(p). 

8 Let X be a normed linear space, and let X* be its dual space, as defined in Sec. 5.21, with the norm 

||/||= sup {|/(x)|:||x|| < 1}. 

( a ) Prove that X* is a Banach space. 

{b) Prove that the mapping /— ► f(x) is, for each x e X, a bounded linear functional on X *, of 
norm |jx||. (This gives a natural imbedding of X in its “second dual ” X**, the dual space of X*.) 
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(c) Prove that {||xj} is bounded if {x„} is a sequence in X such that {/(*„)} is bounded for 
every f e X*. 

9 Let c 0 , d l , and /°° be the Banach spaces consisting of all complex sequences x = {<!;,•}, 
i = 1, 2, 3, . . . , defined as follows: 

x e Z 1 if and only if ||x|| j = £ I £,• I < oo. 

x e /°° if and only if ||x|| „ = sup | f ,■ \ < oo. 

c 0 is the subspace of /°° consisting of all x e /°° for which £,■— ► Oas i— ► oo. 

Prove the following four statements. 

(a) If y = {>/,} e Z 1 and Ax = £ rj t for every xec 0 , then A is a bounded linear functional on 
c 0 , and || A|| = ||y|| v Moreover, every A g (c 0 )* is obtained in this way. In brief, (c 0 )* = d l . 

(More precisely, these two spaces are not equal; the preceding statement exhibits an isometric 
vector space isomorphism between them.) 

(b) In the same sense, (Z 1 )* = /°°. 

(c) Every ye/ 1 induces a bounded linear functional on /°°, as in (a). However, this does not 
give all of (/°°)*, since (/°°)* contains nontrivial functionals that vanish on all of c 0 . 

{d) c 0 and Z 1 are separable but /°° is not. 

10 If £ a, converges for every sequence (fj such that — ► 0 as i— > oo, prove that ^ | a, | < oo. 

11 For 0 < a < 1, let Lip a denote the space of all complex functions / on [ a , h] for which 


M f = sup 

s*t 


I/W-/MI 


< oo. 


Prove that Lip a is a Banach space, if ||/|| = \f(a)\ + M f \ also, if 

ll/ll = M f + sup |/(x)|. 

X 


(The members of Lip a are said to satisfy a Lipschitz condition of order a.) 

12 Let K be a triangle (two-dimensional figure) in the plane, let H be the set consisting of the vertices 
of K, and let A be the set of all real functions / on K, of the form 


/ (x, y) = ax + fly 4- y (a, /?, and y real). 


Show that to each (x 0 , y 0 ) e K there corresponds a unique measure p on H such that 


/(* 0. )'o) = 


A/ 1 *- 


(Compare Sec. 5.22.) 

Replace K by a square, let H again be the set of its vertices, and let A be as above. Show that to 
each point of K there still corresponds a measure on H, with the above property, but that uniqueness 
is now lost. 

Can you extrapolate to a more general theorem? (Think of other figures, higher dimensional 
spaces.) 

13 Let {/„} be a sequence of continuous complex functions on a (nonempty) complete metric space X , 
such that /(x) = lim f n (x ) exists (as a complex number) for every x e X. 

{a) Prove that there is an open set V # 0 and a number M < oo such that | /„(x) | < M for all 
x e V and for n = 1, 2, 3, 

(h) If e > 0, prove that there is an open set V # 0 and an integer N such that | /(x) — f n {x) \ < e 
if x g V and n > N. 

Hint for ( b ): For N = 1, 2, 3, ... , put 


a n = {* : l/*M -/„(*)! < e if m > N and n > N}. 
Since X = (J A N , some A N has a nonempty interior. 
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14 Let C be the space of all real continuous functions on / = [0, 1] with the supremum norm. Let X n 
be the subset of C consisting of those / for which there exists ate/ such that | f(s) — f(t) | ^ n | s — 1 1 
for all s e /. Fix n and prove that each open set in C contains an open set which does not intersect 
X n . (Each / e C can be uniformly approximated by a zigzag function g with very large slopes, and if 
\\g — h\\ is small, h $ X n .) Show that this implies the existence of a dense G s in C which consists 
entirely of nowhere differentiable functions. 

15 Let A = (fly) be an infinite matrix with complex entries, where i, j — 0, 1,2, A associates with 

each sequence {s,} a sequence {c,}, defined by 




oo 


X a ij S j 
j= 0 


0=1, 2, 3,...), 


provided that these series converge. 

Prove that A transforms every convergent sequence {sj} to a sequence {c,} which converges to 
the same limit if and only if the following conditions are satisfied : 

(a) lim a tj = 0 for each j. 

i-*co 

oo 

(b) sup, X Kyi < 00 • 

J=o 

(c) lim £ fly = 1. 
i— oo j= 0 

The process of passing from {sj} to {<r,} is called a summability method. Two examples are: 

777 lros ' st 

0 if i < j, 

and a tj = (1 — r,)rf, 0 < r { < 1, r f — ^ ► 1. 

Prove that each of these also transforms some divergent sequences {sj} (even some unbounded ones) 
to convergent sequences {c,}. 

16 Suppose X and Y are Banach spaces, and suppose A is a linear mapping of X into 7, with the 
following property: For every sequence {x„} in X for which x = lim x n and y = lim Ax n exist, it is 
true that y = Ax. Prove that A is continuous. 

This is the so-called “ closed graph theorem.” Hint : Let X © Y be the set of all ordered pairs 
(x, y), x e X and y e Y, with addition and scalar multiplication defined componentwise. Prove that 
X ® 7 is a Banach space, if ||(x, y)|| = ||x|| + ||y||. The graph G of A is the subset of X © Y formed by 
the pairs (x, Ax), x e X. Note that our hypothesis says that G is closed; hence G is a Banach space. 
Note that (x, Ax)— ► x is continuous, one-to-one, and linear and maps G onto X. 

Observe that there exist nonlinear mappings (of R 1 onto R l , for instance) whose graph is closed 
although they are not continuous: /(x) = 1/x if x # 0,/(0) = 0. 

17 If g. is a positive measure, each / e L°°(/i) defines a multiplication operator M f on l3{fi) into L 2 (/i), 
such that M f {g) = fg. Prove that \\M f \\ < WfW^. For which measures fi is it true that \\M f \\ = WfW^ 
for all / e L°°(/*)? For which / g L°°(/i) does M f map 13(g) onto 13(g)! 

18 Suppose {A„} is a sequence of bounded linear transformations from a normed linear space X to a 
Banach space Y, suppose ||AJ| < M < oo for all n, and suppose there is a dense set E c= X such that 
{A„x} converges for each x e E. Prove that {A„x} converges for each x e X. 
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19 If s n is the nth partial sum of the Fourier series of a function / e C(T), prove that s„/log n— ► 0 
uniformly, as n— ► oo, for each / e C(T). That is, prove that 


lim 

n~* oo 


IKIloo 

log n 


On the other hand, if XJ\o% n— ► 0, prove that there exists an f e C(T) such that the sequence 
{s„(/; 0 )//L„} is unbounded. Hint : Apply the reasoning of Exercise 18 and that of Sec. 5 . 11 , with a 
better estimate of HDJjj than was used there. 

20 (a) Does there exist a sequence of continuous positive functions f n on R l such that {/„(*)} is 
unbounded if and only if x is rational ? 

(b) Replace “ rational ” by “ irrational ” in ( a ) and answer the resulting question. 

(c) Replace “{/„(*)} is unbounded” by “/„(*)— > oo as n— ► oo ” and answer the resulting ana- 
logues of ( a ) and ( b ). 

21 Suppose E c= R l is measurable, and m(E) = 0. Must there be a translate E + x of E that does not 
intersect E? Must there be a homeomorphism h of R 1 onto R 1 so that h(E) does not intersect £? 

22 Suppose f e C(T) and / e Lip a for some a > 0. (See Exercise 11.) Prove that the Fourier series of / 
converges to / (x), by completing the following outline : It is enough to consider the case x = 0, 
/( 0) = 0. The difference between the partial sums s„(/; 0) and the integrals 


n 



sin nt 

dt 

t 


tends to 0 as n— ► oo. The function f{t)/t is in L}(T). Apply the Riemann-Lebesgue lemma. More careful 
reasoning shows that the convergence is actually uniform on T. 
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Total Variation 

6.1 Introduction Let $01 be a a-algebra in a set X. Call a countable collection {E t } 
of members of $01 a partition of E if E t n Ej = 0 whenever i ^ j, and if E — 
(J E t . A complex measure p on $01 is then a complex function on $01 such that 

n(E) = X (E e aw) (1) 

1 = 1 

for every partition of E. 

Observe that the convergence of the series in (1) is now part of the require- 
ment (unlike for positive measures, where the series could either converge or 
diverge to oo). Since the union of the sets E t is not changed if the subscripts are 
permuted, every rearrangement of the series (1) must also converge. Hence ([26], 
Theorem 3.56) the series actually converges absolutely. 

Let us consider the problem of finding a positive measure X which dominates 
a given complex measure p on $01, in the sense that | p(E) \ < X{E) for every E e $01, 
and let us try to keep X as small as we can. Every solution to our problem (if 
there is one at all) must satisfy 

A(E) = £ A(£ ; ) > £ ItffiJI, (2) 

i= 1 1 

for every partition {£j of any set E e $01, so that X(E) is at least equal to the 
supremum of the sums on the right of (2), taken over all partitions of E. This 
suggests that we define a set function | p | on $01 by 

\fi\(E) = sup £ | fi(E,) | (E e (3) 

i = 1 

the supremum being taken over all partitions (Ej of E. 


116 
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This notation is perhaps not the best, but it is the customary one. Note that 
| \i | (E) > | p(E) | , but that in general | \i \ (E) is not equal to | p(E) | . 

It turns out, as will be proved below, that | \i | actually is a measure, so that 
our problem does have a solution. The discussion which led to (3) shows then 
clearly that | \i | is the minimal solution, in the sense that any other solution X has 
the property X(E) >\p\(E) for all E e 

The set function | \i | is called the total variation of p, or sometimes, to avoid 
misunderstanding, the total variation measure. The term “ total variation of p ” is 
also frequently used to denote the number | p | (X). 

If p is a positive measure, then of course | p | = p. 

Besides being a measure, \p\ has another unexpected property: \p\(X) < oo. 
Since | p(E) \ < \ p \ (E) < \ \i \ (X\ this implies that every complex measure \i on 
any a-algebra is bounded: If the range of \i lies in the complex plane, then it 
actually lies in some disc of finite radius. This property (proved in Theorem 6.4) is 
sometimes expressed by saying that \i is of bounded variation. 


6.2 Theorem The total variation \fi\ of a complex measure p on Wl is a positive 
measure on 9W. 

Proof Let {£j be a partition of E e Let t { be real numbers such that 
t t < \p\ ( E t ). Then each E ( has a partition {A tJ } such that 

Z I ^ A ij) I > h (' = 1, 2, 3, . . .)• (1) 

l 

Since {Ay} (ij = 1, 2, 3, . . .) is a partition of E , it follows that 


If,<El^ y )l<l/<l(£). (2) 

i i, j 

Taking the supremum of the left side of (2), over all admissible choices of (tj, 
we see that 


i 


( 3 ) 


To prove the opposite inequality, let {Aj} be any partition of E. Then for 
any fixed j, {Aj n is a partition of A j9 and for any fixed i, {Aj n is a 
partition of E t . Hence 


Z \v( a j)\ 


=z 


Z viA, ^ Ed 


<EEl Ed 

j i 


= H\K A J n £,)| <Zl^l( £ i)- 

i j i 


( 4 ) 
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Since (4) holds for every partition { A } } of E , we have 

l^l(£)<ll^l(£ i ). (5) 

i 

By (3) and (5), | \i | is countably additive. 

Note that the Corollary to Theorem 1.27 was used in (2) and (4). 

That | \i | is not identically oo is a trivial consequence of Theorem 6.4 but 
can also be seen right now, since | \i | (0) = 0. //// 


6.3 Lemma If z l9 . .., z N are complex numbers then there is a subset S of 
{1, ..., N} for which 


Z* 



N 


Z l z *l- 

k= 1 


Proof Write z k = \z k \e i<Xk . For — n < 0 < n, let S(6) be the set of all k for 
which cos (a fc — 0) > 0. Then 





> Re X « w z k = X I h | cos + (a* - d). 

S(6) k= 1 


Choose 0 O so as to maximize the last sum, and put S = S(6 0 ). This 
maximum is at least as large as the average of the sum over [ — n, 7c], and this 
average is n ~ 1 \ z k | , because 

1 C n 1 

— cos + (a — 0) dO = - 
J-n n 


for every a. 


//// 


6.4 Theorem If pis a complex measure on X , then 

\l*\(X)<ao. 


Proof Suppose first that some set E e has \p\(E) = co. Put 
t = 7c(l + | p(E) | ). Since \p\(E)> t, there is a partition {£j of E such that 

Z IM£.)l>t 

i = 1 

for some N. Apply Lemma 6.3, with z t = p{Ei), to conclude that there is a set 
A a E (a. union of some of the sets £*) for which 

\p(A)\>t/n>l. 

Setting B = E — A, it follows that 

\n(B)\ = \m- *a)\ ^i^)i -\m\>- -\m\ = i. 

71 
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We have thus split E into disjoint sets A and B with | p(A) \ > 1 and 
| p(B) | > 1. Evidently, at least one of | p \ (A) and | /x | (JB) is oo, by Theorem 6.2. 

Now if \fi\(X) = oo, split X into A l9 B l9 as above, with \p(A l )\ > 1, 
\n\(Bi) = oo. Split into A 29 B 29 with \p(A 2 )\ > 1, \p\(B 2 ) = oo. Contin- 
uing in this way, we get a countably infinite disjoint collection {Aj, with 
| fi(Ai) | > 1 for each i. The countable additivity of p implies that 

t(\J = I M 

But this series cannot converge, since p(A £ ) does not tend to 0 as i — ► oo. This 
contradiction shows that | p | (X) < °o- //// 

6.5 If p and X are complex measures on the same d-algebra we define \i + X 
and c\i by 


(fi + X)(E) = fi(E) + X(E) 

(EeWl) (1) 

(cvm = cm 

for any scalar c, in the usual manner. It is then trivial to verify that \i + X and c\i 
are complex measures. The collection of all complex measures on is thus a 
vector space. If we put 


M = \v\(n 

it is easy to verify that all axioms of a normed linear space are satisfied. 


(2) 


6.6 Positive and Negative Variations Let us now specialize and consider a real 
measure [i on a <j-algebra (Such measures are frequently called signed mea- 
sures.) Define | \i | as before, and define 

^ + =i<l^l+^)> = i(l/*l — /*)• (1) 

Then both g + and \i~ are positive measures on and they are bounded, by 
Theorem 6.4. Also, 


g = g + - g 9 \g \ = g + + \i . ( 2 ) 

The measures g + and \i~ are called the positive and negative variations of p 9 
respectively. This representation of p as the difference of the positive measures p + 
and p~ is known as the Jordan decomposition of p. Among all representations of 
p as a difference of two positive measures, the Jordan decomposition has a 
certain minimum property which will be established as a corollary to Theorem 
6.14. 
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Absolute Continuity 

6.7 Definitions Let p be a positive measure on a a-algebra 901, and let X be an 
arbitrary measure on 901; X may be positive or complex. (Recall that a 
complex measure has its range in the complex plane, but that our usage of 
the term “positive measure” includes oo as an admissible value. Thus the 
positive measures do not form a subclass of the complex ones.) 

We say that X is absolutely continuous with respect to \i, and write 

X<p (1) 

if X(E) = 0 for every E e 901 for which p(E) = 0. 

If there is a set A e 901 such that X(E) = X (A n E) for every E e 901, we 
say that X is concentrated on A. This is equivalent to the hypothesis that 
X(E) = 0 whenever E n A = 0. 

Suppose X x and X 2 are measures on 901, and suppose there exists a pair of 
disjoint sets A and B such that X x is concentrated on A and X 2 is concen- 
trated on B . Then we say that X x and X 2 are mutually singular , and write 

X x 1 X 2 . (2) 

Here are some elementary properties of these concepts. 

6.8 Proposition Suppose , p, X, X l9 and X 2 are measures on a o-algebra 901, and p 
is positive. 

(a) If X is concentrated on A , so is \X\. 

(b) IfX x 1 X 2 , then \ X x | 1 \X 2 \. 

(c) If X x _L p and X 2 _L p , then X x 4- X 2 _L p. 

(d) If X x p and X 2 p , then X x + X 2 P- 

(e) If X < p , then \ X \ p. 

(/) If X x p and X 2 _L p , then X x _L X 2 . 

(g) If X p and XI p, then X = 0. 

Proof 

(a) If E n A = 0 and {£,} is any partition of E, then X(Ej) = 0 for all j. 
Hence | X \ (E) = 0. 

(b) This follows immediately from (a). 

(c) There are disjoint sets A x and B x such that X x is concentrated on A x and 
p on B l9 and there are disjoint sets A 2 and B 2 such that X 2 is concen- 
trated on A 2 and p on B 2 . Hence X x + X 2 is concentrated on A — A x u 

A 2 , p is concentrated on B = B x n B 2 , and A n B = 0. 

(d) This is obvious. 

(e) Suppose p(E) = 0, and {£,} is a partition of E. Then p(Ej) = 0; and since 
X p, X(Ej ) = 0 for all j, hence £ | X(Ej ) | = 0. This implies | X \ (E) = 0. 



COMPLEX MEASURES 121 


(/) Since X 2 _L p, there is a set A with p(A) = 0 on which X 2 is concentrated. 
Since X x p, XfE) = 0 for every E a A. So X x is concentrated on the 
complement of A. 

(i g ) By (/), the hypothesis of (g) implies, that X _L X, and this clearly forces 

1 = 0 . //// 

We come now to the principal theorem about absolute continuity. In fact, it 
is probably the most important theorem in measure theory. Its statement will 
involve a-finite measures. The following lemma describes one of their significant 
properties. 

6.9 Lemma If pis a positive a-finite measure on a o-algebra 501 in a set X , then 
there is a function xv e l}(p) such that 0 < w(x) < 1 for every x e X. 

Proof To say that p is <r-finite means that X is the union of countably many 
sets E n e yjl (n = 1, 2, 3, . . .) for which p(E n ) is finite. Put w„(x) = 0 if x e 
X — E n and put 


w n (x) = 2~7(1 + p(E n )) 

if x e E n . Then w = £S° w„ has the required properties. //// 

The point of the lemma is that p can be replaced by a finite measure p 
(namely, dp — w dp) which, because of the strict positivity of w, has precisely the 
same sets of measure 0 as p. 

6.10 The Theorem of Lebesgue-Radon-Nikodym Let p be a positive a-finite 
measure on a o-algebra 501 in a set X, and let Xbe a complex measure on 501. 

(a) There is then a unique pair of complex measures X a and X s on 501 such that 

= K + K, K K -L v- (1) 

If X is positive and finite , then so are X a and X s . 

(b) There is a unique h e l}(p) such that 

UE) = jhd/i (2) 

for every set E e 501. 

The pair ( X a , X s ) is called the Lebesgue decomposition of X relative to p. The 
uniqueness of the decomposition is easily seen, for if (X ' a , X' s ) is another pair which 
satisfies (1), then 

K ~ = K ~ K > (3) 

X' a — X a <^ p , and X s — X' s _L p , hence both sides of (3) are 0; we have used 6.8(c), 
6.8(d), and 6.8 (g). 
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The existence of the decomposition is the significant part of (a). 

Assertion (b) is known as the Radon-Nikodym theorem. Again, uniqueness of 
h is immediate, from Theorem 1.39(h). Also, if h is any member of l}{g\ the 
integral in (2) defines a measure on $01 (Theorem 1.29) which is clearly absolutely 
continuous with respect to g. The point of the Radon-Nikodym theorem is the 
converse: Every X \i (in which case X a = X) is obtained in this way. 

The function h which occurs in (2) is called the Radon-Nikodym derivative of 
X a with respect to g. As noted after Theorem 1.29, we may express (2) in the form 
dX a = h dg , , or even in the form h = dXJdg. 

The idea of the following proof, which yields both (a) and ( b ) at one stroke, is 
due to von Neumann. 


Proof Assume first that X is a positive bounded measure on $01. Associate w 
to g as in Lemma 6.9. Then dcp = dX + w dg defines a positive bounded 
measure cp on $01. The definition of the sum of two measures shows that 


\/ d(p = 



(4) 


for / = Xe > hence for simple /, hence for any nonnegative measurable /. If 
/ e L 2 (<p), the Schwarz inequality gives 



< Jj / 1 dX < 


J Jf\dq>£ 



wm 112 - 


Since (p(X) < oo, we see that 


( 5 ) 

is a bounded linear functional on L 2 (<^). We know that every bounded linear 
functional on a Hilbert space H is given by an inner product with an element 
of H. Hence there exists a g e L 2 (<p) such that 

[fdX=\fgdcp ( 6 ) 

Jx Jx 


for every / e L 2 (<p). 

Observe how the completeness of L 2 (^) was used to guarantee the exis- 
tence of g. Observe also that although g is defined uniquely as an element of 
L 2 (^), g is determined only a.e. [ 9 ] as a point function on X. 

P ut / = Xe i n ( 6 ), for any E e $01 with (p(E) > 0. The left side of ( 6 ) is then 
X(E ), and since 0 < X < <p, we have 

<p( £ ) Je <P(E) 


( 7 ) 
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Hence g(x) e [0, 1] for almost all x (with respect to cp ), by Theorem 1.40. We 
may therefore assume that 0 < g(x) < 1 for every x e X 5 without affecting (6), 
and we rewrite (6) in the form 

[(1 - 9)f M = (* fgw dg. ( 8 ) 

Jx Jx 

Put 

A = {x: 0 < g(x) < 1}, B = {x: g(x) = 1}, (9) 

and define measures A a and A s by 

A a (E) = A (A n E\ A S (E) = A(B n £), (10) 

for all E e 90L 

If f — 1 b m (8), the left side is 0, the right side is w dg. Since w(x) > 0 
for all x, we conclude that g(B) — 0. Thus A s _L g. 

Since g is bounded, (8) holds if / is replaced by 

(1 + g + ■ ■ ■ + g n )x E 

for n— 1, 2, 3 e 9W. For such /, (8) becomes 

f (1 -g n + i )ds 1 = 0(1 +0 + ••• +g n )w dg. ( 11 ) 

JE JE 

At every point of 1 5, g(x) = 1, hence 1 — g n + 1 (x) = 0. At every point of A, 
g n + 1 (x)—> 0 monotonically. The left side of (11) converges therefore to 
A(A n E) = A a (E) as n^> oo. 

The integrands on the right side of (11) increase monotonically to a non- 
negative measurable limit h , and the monotone convergence theorem shows 
that the right side of (1 1) tends to J £ h dg as n— ► oo. 

We have thus proved that (2) holds for every E e $01. Taking E = X, we 
see that h e I}(g ), since A a (X) < oo. 

Finally, (2) shows that A a < g, and the proof is complete for positive A. 

If A is a complex measure on then A = A x iA 2 , with A t and A 2 real, 
and we can apply the preceding case to the positive and negative variations 
of A x and A 2 . //// 

If both g and A are positive and a-finite, most of Theorem 6.10 is still true. 
We can now write X = X„, where g(X„) < oo and A(X„) < oo, for n — 1, 2, 3, 

.... The Lebesgue decompositions of the measures A(E n X n ) still give us a 
Lebesgue decomposition of A , and we still get a function h which satisfies 
Eq. 6.10(2); however, it is no longer true that h e I}(g\ although h is “locally in 
L 1 ,” i.e., h dg < oo for each n. 

Finally, if we go beyond a-finiteness, we meet situations where the two theo- 
rems under consideration actually fail. For example, let g be Lebesgue measure 
on (0, 1), and let A be the counting measure on the a-algebra of all Lebesgue 
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measurable sets in (0, 1). Then X has no Lebesgue decomposition relative to p , and 
although p X and p is bounded , there is no h e I}(X) such that dp — h dX. We 
omit the easy proof. 

The following theorem may explain why the word “continuity” is used in 
connection with the relation X p. 

6.11 Theorem Suppose p and X are measures on a a-algebra 501, p is positive , 

and X is complex. Then the following two conditions are equivalent: 

(a) X <$ p. 

(b) T o every e > 0 corresponds a S > 0 such that | X(E) | < e for all E e $Jl 
with p(E) < S. 


Property (b) is sometimes used as the definition of absolute continuity. 
However, (a) does not imply ( b ) if X is a positive unbounded measure. For 
instance, let p be Lebesgue measure on (0, 1), and put 


X(E) = 



dt 


for every Lebesgue measurable set E c= (0, 1). 


Proof Suppose ( b ) holds. If p(E) = 0, then p(E) < d for every S > 0, hence 
| X(E) | < e for every e > 0, so X(E) = 0. Thus ( b ) implies (a). 

Suppose (b) is false. Then there exists an e > 0 and there exist sets E n e 
(n = 1, 2, 3, . . .) such that p(E n ) <2~ n but | X(E n ) \ > e. Hence | X \ (E n ) > e. 
Put 


A n =(jEi, A=f]A n . (1) 

i = n n = 1 

Then p(A n ) < 2~ n + 1 , A n => A n + l , and so Theorem 1.19(e) shows that p(A) = 0 
and that 


| X | (A) = lim | X | (A n ) > e > 0, 

n-> oo 


since | A | (A„) > |A| (£„). 

It follows that we do not have | A | < /t, hence (a) is false, by Proposition 

6.8(c). HH 

Consequences of the Radon-Nikodym Theorem 

6.12 Theorem Let n be a complex measure on a a-algebra 9W in X. Then there 
is a measurable function h such that \ h(x) \ = 1 for all x e X and such that 


dp = h d | p | . 


(1) 
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By analogy with the representation of a complex number as the product of 
ts absolute value and a number of absolute value 1, Eq. (1) is sometimes referred 
o as the polar representation (or polar decomposition) of p. 


Proof It is trivial that p \ p | , and therefore the Radon-Nikodym theorem 
guarantees the existence of some h e l}( \ p \ ) which satisfies (1). 

Let A r = {x: \ h(x) \ < r}, where r is some positive number, and let { Ej } be 
a partition of A r . Then 




hd\p I 

Ej 


<Zr\p\(Ej) = r\p\(A r \ 


so that | p | ( A r ) <r\p\ ( A r ). If r < 1, this forces | p | ( A r ) = 0. Thus | h \ > 1 a.e. 
On the other hand, if | p \ (E) > 0, (1) shows that 


1 

i nm 



d \fi\ 


\m\ . 

\nm~ 


We now apply Theorem 1.40 (with the closed unit disc in place of S) and 
conclude that | h \ < 1 a.e. 

Let B = {x e X: \ h(x) \ ^ 1}. We have shown that | p \ (B) = 0, and if we 
redefine h on B so that h(x) = 1 on B, we obtain a function with the desired 
properties. //// 


6.13 Theorem Suppose pis a positive measure on$Jl,g e l}(p ), and 


Then 


m = 


m (£) = 



(e e m). 


(e e my 


(i) 


( 2 ) 


Proof By Theorem 6.12, there is a function h , of absolute value 1, such that 
dX — h d |2| . By hypothesis, dX = g dp. Hence 

h d \X \ = g dp. 

This gives d \X \ = hg dp. (Compare with Theorem 1.29.) 

Since \X \ > 0 and p > 0, it follows that fig >0 a.e. [//], so that hg = | g \ 

а. e. M. //// 

б. 14 The Hahn Decomposition Theorem Let p be a real measure on a o- 
algebra m in a set X. Then there exist sets A and B em such that 
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A U B = X, A n B = 0, ant/ such that the positive and negative variations 
p + and p~ of p satisfy 

p + (E) = p(A n £), p~(E) = -p(B n E) (Eg 9M). (1) 


In other words, X is the union of two disjoint measurable sets A and B , such 
that “ A carries all the positive mass of p” [since (1) implies that p(E)> 0 if 
£c/4] and “B carries all the negative mass of /*” [since p(E) < 0 if E a B]. The 
pair (>4, B) is called a Hahn decomposition of X , induced by p. 


Proof By Theorem 6.12, = h d \ p \ , where \ h\ — 1. Since p is real, it fol- 

lows that h is real (a.e., and therefore everywhere, by redefining on a set of 
measure 0), hence h = + 1. Put 


/I = {x: h(x) = 1}, B = {x: h(x) = — 1}. 
Since p + =%(\p \ + p) 9 and since 


(2) 


Ki + fc) 


f/i on >4, 
jo on B , 


( 3 ) 


we have, for any E g 9W, 

^(£)-if(l + *)^l-| = (4) 

Z JE JE n A 

Since /x(£) = p(E n ,4) 4- p(E n B) and since // = //"*" — the second half of 
(1) follows from the first. //// 


Corollary If p = > where and positive measures , f/ien > p + 

and X 2 > p~ - 


This is the minimum property of the Jordan decomposition which was men- 
tioned in Sec. 6.6. 

Proof Since p < we have 

p + (E) = p(E n A) < ^(E n A) < ^(E). //// 


Bounded Linear Functionals on IF 

6.15 Let p be a positive measure, suppose 1 < p < oo, and let q be the exponent 
conjugate to p. The Holder inequality (Theorem 3.8) shows that if g g I3(p) and if 
is defined by 

*JJ) = \fy dfi, 


( 1 ) 
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then <S> g is a bounded linear functional on LF(p), of norm at most \\g\\ q . The ques- 
tion naturally arises whether all bounded linear functionals on H(p) have this 
form, and whether the representation is unique. 

For p = oo, Exercise 13 shows that the answer is negative: Lf(m) does not 
furnish all bounded linear functionals on n°(m). For 1 < p < oo, the answer is 
affirmative. It is also affirmative for p — 1, provided certain measure-theoretic 
pathologies are excluded. For a-finite measure spaces, no difficulties arise, and we 
shall confine ourselves to this case. 

6.16 Theorem Suppose 1 < p < oo, pis a a -finite positive measure on X , and O 
is a bounded linear functional on B(p). Then there is a unique g e I3(p ), where q 
is the exponent conjugate to p , such that 

<Hf) = j fo dfi (fe U(n)). (1) 

Moreover , if O and g are related as in (1), we have 

non = iwi,. (2) 

In other words, I3(p) is isometrically isomorphic to the dual space of B(p), 
under the stated conditions. 

Proof The uniqueness of g is clear, for if g and g' satisfy (1), then the integral 
of g — g' over any measurable set E of finite measure is 0 (as we see by taking 
Xe for /), and the a-finiteness of p implies therefore that g — g’ = 0 a.e. 

Next, if (1) holds, Holder’s inequality implies 

non < ii0ii f . (3) 

So it remains to prove that g exists and that equality holds in (3). If ||0|| = 0, 
(1) and (2) hold with g = 0. So assume ||0|| > 0. 

We first consider the case p(X) < oo. 

For any measurable set E a X, define 

m = «*x E ). 

Since Q> is linear, and since Xa u b = Xa + Xb if A an d B are disjoint, we see 

that X is additive. To prove countable additivity, suppose E is the union of 

countably many disjoint measurable sets E i9 put A k = E x u • u E k , and 
note that 

life - XA k lip = WE - A k )y ,p -+ 0 (*- oo); (4) 

the continuity of O shows now that X(A k )— ► X(E). So X is a complex measure. 
[In (4) the assumption p < oo was used.] It is clear that X(E) = 0 if p(E) = 0, 
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since then \\xe\\ p = 0- Thus X g, and the Radon-Nikodym theorem ensures 
the existence of a function g e l}(g) such that, for every measurable E c= X, 

= \9 dn = \Xe 9 dfi. (5) 

JE JX 

By linearity it follows that 

= j fg dn (6) 

holds for every simple measurable /, and so also for every / e E°(g\ since 
every / e E°(g) is a uniform limit of simple functions f t . Note that the uniform 
convergence of f t to/implies \\f t -f\\ p -> 0, hence d >(/ )-► <D (/), as i—> oo. 

We want to conclude that g e 13(g) and that (2) holds ; it is best to split 
the argument into two cases. 


Case 1 p = 1. Here (5) shows that 



<ii<*>ii • \\xe\\ i = ii<dii -m 


for every E e 901. By Theorem 1.40, \g(x) \ < ||0|| a.e., so that WgW^ < ||0||. 


Case 2 1 < p < oo. There is a measurable function a, |a| = 1, such that 
ag = \g\ [Proposition 1.9(e)]. Let E n = {x: \g(x)\<n}, and define / = 
Xe„ l0l" -1 a' Then \f\ p = \g\ q on E n ,fe and (6) gives 

f \g\ 9 dn= [fgdn = <t>{f)< ||4>||{f \g\< 

jEn JX (.t/En 

so that 

^XE n \g\‘ , dfi<m q (»= 1,2,3,...). (7) 

If we apply the monotone convergence theorem to (7), we obtain \\g\\ q < ||0||. 

Thus (2) holds and g e 13(g). It follows that both sides of (6) are contin- 
uous functions on 13(g). They coincide on the dense subset 13° (g) of 13(g); 
hence they coincide on all of I3(g\ and this completes the proof if g(X) < oo. 

If g(X) = oo but g is a-finite, choose w e l}(g) as in Lemma 6.9. Then 
dg = w dg defines a finite measure on 901, and 

F— ► w l/p F (8) 

is a linear isometry of 13(g) onto 13(g) , because w(x) > 0 for every x e X. 
Hence 



¥(F) = ®(w 1/p F) 

defines a bounded linear functional ¥ on 13(g ) , with H'FH = ||0||. 


( 9 ) 
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The first part of the proof shows now that there exists G e 13(g) such that 
'V(F) = j* FG dfi (F e (10) 

Put g = w 1/q G. (If p = 1, g = G.) Then 

fl g\ q dg= [\G\ q dg = |W= \m\ q (11) 

JX JX 

if p > 1, whereas | \g\\„ = ||G|L = \m = ||0|| if p = 1. Thus (2) holds, and 
since G dg = w 1,p g dg, we finally get 

<!>(/) = Viv-Wf) = f W^/G rf/i = I fg dg (12) 

JX Jx 

for every fe L p (g). //// 

6.17 Remark We have already encountered the special case p = q = 2 of 
Theorem 6.16. In fact, the proof of the general case was based on this special 
case, for we used the knowledge of the bounded linear functionals on l}(g) in 
the proof of the Radon-Nikodym theorem, and the latter was the key to the 
proof of Theorem 6.16. The special case p = 2, in turn, depended on the 
completeness of L 2 (p), on the fact that l}(g) is therefore a Hilbert space, and 
on the fact that the bounded linear functionals on a Hilbert space are given 
by inner products. 

We now turn to the complex version of Theorem 2.14. 


The Riesz Representation Theorem 

5.18 Let X be a locally compact Hausdorff space. Theorem 2.14 characterizes the 
positive linear functionals on C c (X). We are now in a position to characterize the 
bounded linear functionals <t> on C c (X). Since C c (X) is a dense subspace of C 0 (X), 
relative to the supremum norm, every such O has a unique extension to a 
bounded linear functional on C 0 (X). Hence we may as well assume to begin with 
that we are dealing with the Banach space C 0 (X). 

If g is a complex Borel measure, Theorem 6.12 asserts that there is a complex 
Borel function h with | h | = 1 such that dg = h d \ g \ . It is therefore reasonable to 
define integration with respect to a complex measure g by the formula 


J fdF = | ' fh d\n\. 

The relation J Xe dp = t l (F) is a special case of (1). Thus 


(1) 


[z. d(F + A) = (M + AXE) = m + m = 


Xe d H + Xe dk 
x JX 


( 2 ) 
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whenever p and X are complex measures on 901 and E e 901. This leads to the 
addition formula 

f/d^ + A)= (fdn+ [fdX, (3) 

%} x %)x %) x 

which is valid (for instance) for every bounded measurable f 

We shall call a complex Borel measure p on X regular if | p | is regular in the 
sense of Definition 2.15. If p is a complex Borel measure on X , it is clear that the 
mapping 

(4) 

is a bounded linear functional on C 0 (X\ whose norm is no larger than | p | (X). 
That all bounded linear functionals on C 0 (X) are obtained in this way is the 
content of the Riesz theorem: 

6.19 Theorem If X is a locally compact Hausdorff space , then every bounded 
linear functional O on C 0 (X) is represented by a unique regular complex Borel 
measure p, in the sense that 

c 

*/ = fdn (1) 

Jx 

for every f e C 0 (X). Moreover , the norm of O is the total variation of p : 

m = i /*!(*). (2) 

Proof We first settle the uniqueness question. Suppose p is a regular 
complex Borel measure on X and J/ dp = 0 for all / e C 0 (X). By Theorem 
6.12 there is a Borel function h, with | h \ = 1, such that dp = h d | p | . For any 
sequence {/„} in C 0 (X) we then have 

1^1(20= \{h-f n )hd\p\< (\h-f n \d\p\, (3) 

Jx Jx 

and since C c (X) is dense in l}( | p | ) (Theorem 3.14), {/„} can be so chosen that 
the last expression in (3) tends to 0 as n— ► oo. Thus | p \ (X) = 0, and p — 0. It 
is easy to see that the difference of two regular complex Borel measures on X 
is regular. This shows that at most one p corresponds to each O. 

Now consider a given bounded linear functional O on C 0 (X). Assume 
||0|| = 1, without loss of generality. We shall construct a positive linear func- 
tional A on C c (X\ such that 

|<D(/)I<A(I/I)< ll/ll (/eC c (J > ()), (4) 

where ||/|| denotes the supremum norm. 
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Once we have this A, we associate with it a positive Borel measure X 9 as 
in Theorem 2.14. The conclusion of Theorem 2.14 shows that X is regular if 
X(X) < oo. Since 


X(X) = sup {A/: 0 </< 1,/e C c (X)} 

and since | A/| < 1 if ||/|| < 1, we see that actually X(X) < 1. 
We also deduce from (4) that 


| <£(/) | < A( | / 1 ) = 




(fe C c (X)). 


( 5 ) 


The last norm refers to the space I}(X). Thus <I> is a linear functional on C c (X) 
of norm at most 1, with respect to the l}(X)-norm on C c (X). There is a norm- 
preserving extension of O to a linear functional on I}(X ), and therefore 
Theorem 6.16 (the case p = 1) gives a Borel function g 9 with \g \ < 1, such 
that 


<D(/) = !fg dl (feC c (X)). (6) 

Each side of (6) is a continuous functional on C 0 (X) 9 and C c (X) is dense in 
C 0 (AT). Hence (6) holds for all / e C 0 (X), and we obtain the representation (1) 
with dp = g dX. 

Since \\Q>\\ = 1,(6) shows that 

Jjffl dl > sup {| <Hf) | :/ e C 0 (X), ||/|| < 1} = 1. (7) 

We also know that X(X) < 1 and \ g\ < 1. These facts are compatible only if 
X{X) = 1 and \g \ = 1 a.e. [A]. Thus d \p \ = \ g\ dX — dX 9 by Theorem 6.13, 
and 


\l*\(X) = X(X) = 1 = ||0||, 


( 8 ) 


which proves (2). 

So all depends on finding a positive linear functional A that satisfies (4). 
life Cf (X) [the class of all nonnegative real members of C c (X)] 9 define 

A/= sup { | m | : he C c (X), \ h \ </}. (9) 

Then A/ > 0, A satisfies (4), 0 </, </ 2 implies A/ x < A f 2 , and A (c/) = cA/ if 
c is a positive constant. We have to show that 

A (/+ g) = A/+ Kg (/and g e C c + (X)), (10) 

and we then have to extend A to a linear functional on C c (X). 

Fix / and g e C*(X). If e > 0, there exist h 1 and h 2 e C c ( X) such that 
| hi | </, | h 2 1 < g, and 

Af^Wh^+e, Ag<\<Hh 2 )\ + e. 


(11) 
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There are complex numbers <x i9 | a £ | = 1, so that a f O(/i f ) = |0(h f )|, i = 1, 2. 
Then 


A/+ A0<|<D(/j 1 )| + |O(MI + 2e 

= + a 2 h 2 ) + 2e 

< A(|fc 1 | + |fc 2 |) + 2e 
^ A(/+ g) + 2c, 

so that the inequality > holds in (10). 

Next, choose h e C c (X), subject only to the condition \h\ </ + g, let 
V = {x:f(x) + g(x) > 0}, and define 


hi(x) 


fjxMx) 

fix) + g(x)’ 


h 2 (x) 


g(x)Mx) 

f(x) + g(x) 


h t (x) = h 2 (x) = 0 (x $ V). 


(x e V), 


( 12 ) 


It is clear that h Y is continuous at every point of V. If x 0 $ V 9 then h(x 0 ) = 0; 
since h is continuous and since I^MI ^ \K X )\ ^ or a U x e X, it follows that 
x 0 is a point of continuity of h v Thus h x e C c (X\ and the same holds for h 2 . 
Since h x + h 2 — h and | h Y \ <f 9 1 h 2 \ < g, we have 


| m I = I <D (h t ) + ®(fc 2 ) | < | <D (fci) | + | ®(fc 2 ) I ^ A/+ 


Hence A(/ H- gf) < A/ + Agf, and we have proved (10). 

If / is now a real function, / e C c (AT), then If + = \ f \ + /, so that / + e 
C c + (AT); likewise,/ - e Cf(X); and since f =f + — /“, it is natural to define 

A/= A/ + - A/ - (/e C c (AT), / real) (13) 


and 


A {u H- iu) = Aw H- iAi;. (14) 

Simple algebraic manipulations, just like those which occur in the proof of 
Theorem 1.32, show now that our extended functional A is linear on C c (X). 
This completes the proof. //// 


Exercises 

1 If n is a complex measure on a c-algebra 93t, and if E e 93t, define 

the supremum being taken over all finite partitions {£,•} of E. Does it follow that X = | jj. \ ? 

2 Prove that the example given at the end of Sec. 6.10 has the stated properties. 

3 Prove that the vector space M{X) of all complex regular Borel measures on a locally compact 
Hausdorff space X is a Banach space if \\fi\\ = |/u|(X). Hint: Compare Exercise 8, Chap. 5. [That the 
difference of any two members of M{X) is in M(X) was used in the first paragraph of the proof of 
Theorem 6.19; supply a proof of this fact.] 
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4 Suppose 1 < p < oo, and q is the exponent conjugate to p. Suppose p is a positive c-finite measure 
and g is a measurable function such that fg e l}{p) for every / e H(p). Prove that then g e 13{p). 

5 Suppose X consists of two points a and b; define p({a}) — 1, p({b}) = p{X) = oo, and p(0) = 0. Is it 
true, for this p, that E°(p) is the dual space of I}(p)*! 

6 Suppose 1 < p < oo and prove that I3(p) is the dual space of U(p) even if p is not cr-finite. (As usual, 

i Ip + l/q = l.) 

7 Suppose p is a complex Borel measure on [0, 2 n) (or on the unit circle T), and define the Fourier 
coefficients of p by 

fi(n) = | e"'"' dfi(t) (n = 0, ± 1, ±2, .. .). 

Assume that /)(«)—► 0 as n— ► + oo and prove that then £(«)—► 0 as n— ► — oo. 

Hint : The assumption also holds with / dp in place of dp if / is any trigonometric polynomial, 
hence if / is continuous, hence if / is any bounded Borel function, hence if dp is replaced by d \ p \ . 

8 In the terminology of Exercise 7, find all p such that p is periodic, with period k. [This means that 
p(n + k) = p(n) for all integers n; of course, k is also assumed to be an integer.] 

9 Suppose that {g n } is a sequence of positive continuous functions on / = [0, 1], that p is a positive 
Borel measure on /, and that 

(i) a„(x) = 0 a.e. [m], 

(ii) Jj g n dm = 1 for all n, 

(iii) lim„^ „ f, fg„ dm = \, f dg for every / 6 C(I). 

Does it follow that p 1 m? 

10 Let (X, 901, p) be a positive measure space. Call a set $ c l}(p) uniformly integrable if to each € > 0 
corresponds a S > 0 such that 

IJ/^j <« 

whenever / e < P and p{E) < 3. 

( a ) Prove that every finite subset of l}{p) is uniformly integrable. 

(b) Prove the following convergence theorem of Vitali : 

If (i) p{X) < oo, (ii) {/„} is uniformly integrable, (iii) /„(*)— ►/(*) a.e. as n— > oo, and (iv) |/(x) | 

< oo a.e., then f e l}(p) and 

lim f |/„-/| dp = 0. 

n-*ao JX 

Suggestion: Use Egoroff’s theorem. 

(c) Show that ( b ) fails if p is Lebesgue measure on ( — oo, oo), even if {H/JU} is assumed to be 
bounded. Hypothesis (i) can therefore not be omitted in ( b ). 

(d) Show that hypothesis (iv) is redundant in ( b ) for some p (for instance, for Lebesgue measure 
on a bounded interval), but that there are finite measures for which the omission of (iv) would make 
(b) false. 

{e) Show that Vitali’s theorem implies Lebesgue’s dominated convergence theorem, for finite 
measure spaces. Construct an example in which Vitali’s theorem applies although the hypotheses of 
Lebesgue’s theorem do not hold. 

(/) Construct a sequence {/„}, say on [0, 1], so that f n (x)—* 0 for every x, J /„— ► 0, but {/„} is not 
uniformly integrable (with respect to Lebesgue measure). 

(g) However, the following converse of Vitali’s theorem is true: 

If p{X) < oo ,/„ e L\p), and 

lim | f n dp 

n-* oo JE 

exists for every E e then {/„} is uniformly integrable. 
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Prove this by completing the following outline. 

Define p(A, B) = J | x A — Xb I dp. Then (901, p) is a complete metric space (modulo sets of measure 
0), and £-► J £ f n dp is continuous for each n. If € > 0, there exist E 0 , 6, N (Exercise 13, Chap. 5) so 
that 


II. 


(fn ~ f N ) dp 


< € if p(E, E 0 ) < 6 , n> N. 


n 


If p(A) < S, (*) holds with B = E 0 — A and C = E 0 u A in place of E. Thus (*) holds with A in place 
of E and 2c in place of €. Now apply (a) to {f l9 . . . , f N } : There exists S' > 0 such that 



<3c 


if p{A) < S', 


n — 1,2, 3,.... 


11 Suppose p is a positive measure on X, p(X) < oo ,/„ e I}(p) for n — 1, 2, 3, . . .,/„(x)— ► /(x) a.e., and 
there exists p > 1 and C < oo such that 1 /„ | p < C for all n. Prove that 

Hm [\f-f n \dp = 0. 

n-oo Jx 


Hint : {/„} is uniformly integrable. 

12 Let be the collection of all sets E in the unit interval [0, 1] such that either E or its complement 
is at most countable. Let p be the counting measure on this a-algebra 9W. If g(x) — x for 0 <; x <; 1, 
show that g is not 9K-measurable, although the mapping 


Xf(x) = j fg dn 

makes sense for every / e I}(p) and defines a bounded linear functional on l}(p). Thus (L 1 )* ^ L 00 in 
this situation. 

13 Let L°° = L®(m), where m is Lebesgue measure on / = [0, 1]. Show that there is a bounded linear 
functional A ^ 0 on L°° that is 0 on C(/), and that therefore there is no g e l}(m) that satisfies A f = 
fg dm for every / e L°°. Thus (L 00 )* ^ L 1 . 



CHAPTER 

SEVEN 

DIFFERENTIATION 


In elementary Calculus we learn that integration and differentiation are inverses 
of each other. This fundamental relation persists to a large extent in the context 
of the Lebesgue integral. We shall see that some of the most important facts 
about differentiation of integrals and integration of derivatives can be obtained 
with a minimum of effort by first studying derivatives of measures and the associ- 
ated maximal functions. The Radon-Nikodym theorem and the Lebesgue decom- 
position will play a prominent role. 


Derivatives of Measures 

We begin with a simple, theorem whose main purpose is to motivate the defini- 
tions that follow. 


7.1 Theorem Suppose p is a complex Bor el measure on R 1 and 

f(x) = p(( - oo, x)) (x e R 1 ). (1) 

If x e R 1 and A is a complex number , each of the following two statements 
implies the other: 

(a) f is differentiable at x and f\x) = A. 

(b) T o every 6 > 0 corresponds a S > 0 such that 


M) 

m(I) 


A 


< 6 


(2) 


135 
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for every open segment I that contains x and whose length is less than S. Here 
m denotes Lebesgue measure on R 1 . 


7.2 Definitions Theorem 7.1 suggests that one might define the derivative of 
p at x to be the limit of the quotients p(I)/m(I% as the segments / shrink to x, 
and that an analogous definition might be appropriate in several variables, 
i.e., in R k rather than in R 1 . 

Accordingly, let us fix a dimension k , denote the open ball with center 
x e R k and radius r > 0 by 

B(x , r) = {y e R k : \ y - x\ <r} (1) 

(the absolute value indicates the euclidean metric, as in Sec. 2.19), associate 
to any complex Borel measure p on R k the quotients 


(Grrtto = 


l*(B(x, r)) 
m(B(x , r)Y 


(2) 


where m = m k is Lebesgue measure on R k , and define the symmetric derivative 
of pat x to be 


(Dp)(x) = lim (Q r p)(x) (3) 

r~+ 0 

at those points x e R k at which this limit exists. 

We shall study Dp by means of the maximal function Mp. For p > 0, this 
is defined by 


(Mp)(x) = sup (Q r p\x\ (4) 

0 <r< oo 

and the maximal function of a complex Borel measure p is, by definition, that 
of its total variation | p | . 

The functions Mp: R k ^> [0, oo] are lower semicontinuous , hence measur- 
able. 

To see this, assume p > 0, pick X > 0, let E = {Mp > X}, and fix x e E. 
Then there is an r > 0 such that 


p{B(x, r)) = tm{B{x , r)) (5) 

for some t > X, and there is a S > 0 that satisfies 

(r + S) k < i*t/X. (6) 

If | y — x | < <5, then B(y, r + S) z> £(x, r), and therefore 

p(B(y , r + <5)) > tm(B(x, r)) = t[r/(r -h ^)] fc m(B(y, r + <5)) > Am(B(y, r + <5)). 

Thus B(x, c E. This proves that E is open. 

Our first objective is the “ maximal theorem ” 7.4. The following covering 
lemma will be used in its proof. 
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7.3 Lemma If W is the union of a finite collection of balls , r f ), 1 < i < N, 
then there is a set S c {1, . . . , N} so that 

(a) the balls , r t ) with i e S are disjoint , 
lb) W a (J B{x i9 3r^and 

(c) m(W)< 3* X r,)). 

i e S 


Proof Order the balls B { = B(x i9 r t ) so that r x > r 2 > • • • > r N . Put ^ = 1. 
Discard all Bj that intersect B lV Let B h be the first of the remaining B j9 if 
there are any. Discard all Bj with j > i 2 that intersect B i2 , let B h be the first 
of the remaining ones, and so on, as long as possible. This process stops after 
a finite number of steps and gives S = {i u i 2i . . 

It is clear that (a) holds. Every discarded Bj is a subset of B(x i9 3 r t ) for 
some i e 5, for if r f < r and B(x\ r') intersects B(x , r), then B(x\ r') c= B(x , 3r). 
This proves (h), and (c) follows from ( b ) because 

m(B(x 9 3 r)) = 3 r)) 

in HI I 

The following theorem says, roughly speaking, that the maximal function of 
a measure cannot be large on a large set. 

7.4 Theorem If \i is a complex Borel measure on R k and X is a positive number , 
then 

m{Mp > X) < 3 k X~ i \\p\\. (1) 

Here \\p\\ = \p\(R% and the left side of (1) is an abbreviation for the 
more cumbersome expression 

m({x e R k : (Mp)(x) > A}). (2) 

We shall often simplify notation in this way. 

Proof Fix p and X. Let K be a compact subset of the open set {Mp > X}. 
Each x e K is the center of an open ball B for which 

\p\(B)>Xm(B). 

Some finite collection of these B’s covers K , and Lemma 7.3 gives us a dis- 
joint subcollection, say {B u . . . , £„}, that satisfies 

m{K) <3 k t m(Bi) < 3 ‘l’ 1 £ | < 3 k r l M\. 

1 1 

The disjointness of {B u . . . , B n } was used in the last inequality. 

Now (1) follows by taking the supremum over all compact 

K c {Mp > X}. HU 
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7.5 Weak L 1 Iff e I}(R k ) and X > 0, then 

m { I/I > iii 

because, putting £ = { | / 1 > A}, we have 


-I' 


Xm(E) < J | / 1 dm < 


I/I dm = 




Accordingly, any measurable function / for which 


( 1 ) 

(2) 


X-m{\f\>X) (3) 

is a bounded function of X on (0, oo) is said to belong to weak L 1 . 

Thus weak L 1 contains L 1 . That it is actually larger is shown most simply by 
the function 1/x on (0, 1). 

We associate to each / e I}(R k ) its maximal function Mf: R k —> [0, oo], by 
setting 


(Mf)(x) = sup -f- f |/| dm. (4) 

0 <r < oo m \°r) JR(JC, r) 

[We wrote B r in place of B(x , r) because m(B(x 9 r)) depends only on the radius r.] 
If we identify / with the measure /i given by d\i = / dm, we see that (4) agrees with 
the previously defined M\i. Theorem 7.4 states therefore that the “maximal 
operator” M sends L 1 to weak L 1 , with a bound (namely 3 fc ) that depends only on 
the space R k : 

For every f e l}(R k ) and every X > 0, 

m{Mf>X}<3 k X- l \\f\\ 1 . (5) 


7.6 Lebesgue points Iff e 1}(R% any x e R k for which it is true that 

lim ~~ [ I f(y) -fix) | dm(y) = 0 (1) 

r-+ 0 jB(x,r ) 

is called a Lebesgue point of f 

For example, (1) holds if /is continuous at the point x. In general, (1) means 
that the averages of \ f— f(x)\ are small on small balls centered at x. The 
Lebesgue points of / are thus points where / does not oscillate too much, in an 
average sense. 

It is probably far from obvious that every / e L 1 has Lebesgue points. But the 
following remarkable theorem shows that they always exist. (See also Exercise 
23.) 


7.7 Theorem Iff e I}(R% then almost every x e R k is a Lebesgue point of f 
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Proof Define 


= f 1 f~f(x)\dm 

jB(x, r) 

(1) 

for xe^,r>0, and put 


(TfX x ) = lim sup (T r f)(x). 

r-0 

(2) 

We have to prove that T/= 0 a.e. [m]. 

Pick y > 0. Let n be a positive integer. By Theorem 3.14, 
g e C(R k ) so that \\f- g\\ x < 1/n. Put h =/- g. 

Since g is continuous, Tg = 0. Since 

there exists 

(T r h)(x) < ~ ~r f \h\dm + \h(x)\ 

m(B r ) J B(x , r) 

(3) 

we have 


Th < Mh + \h\. 

(4) 

Since T r f< T r g + T r h, it follows that 


Tf<Mh + \h\. 

(5) 

Therefore 


{Tf > 2y} c {Mh > y} u { | h \ > y}. 

(6) 

Denote the union on the right of (6) by E(y, n). Since 
Theorem 7.4 and the inequality 7.5(1) show that 

llfcll i < 1 In, 

m(E(y, n)) < (3* + l)/(yn). 

(7) 

The left side of (6) is independent of n. Hence 


{Tf>2y}cz f) E(y, ri). 

(8) 


n= 1 


This intersection has measure 0, by (7), so that {Tf > 2 y} is a subset of a set 
of measure 0. Since Lebesgue measure is complete, {Tf > 2 y) is Lebesgue 
measurable, and has measure 0. This holds for every positive y. Hence Tf = 0 
a e. [m]. //// 

Theorem 7.7 yields interesting information, with very little effort, about 
topics such as 

(a) differentiation of absolutely continuous measures, 

(b) differentiation using sets other than balls, 

(c) differentiation of indefinite integrals in R 1 , 

(d) metric density of measurable sets. 
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We shall now discuss these topics. 


7.8 Theorem Suppose p is a complex Borel measure on R k , and p m. Let f be 
the Radon-Nikodym derivative of p with respect to m. Then Dp = f a.e. [m], 
and 


H(E) = j* (Du) dm (1) 

for all Borel sets E a R k . 

In other words, the Radon-Nikodym derivative can also be obtained as a 
limit of the quotients Q r p. 


Proof The Radon-Nikodym theorem asserts that (1) holds with /in place of 
Dp. At any Lebesgue point x of f it follows that 


f(x) = lim 

r-o m(B r ) 


— r 

B r ) Jr(x, » 


f dm — lim 


p(B(x, r)) 


o m(£(x, r)) 


( 2 ) 


Thus (Dp)(x) exists and equals /(x) at every Lebesgue point of f hence a.e. 

Lml III! 


7.9 Nicely shrinking sets Suppose x e R k . A sequence {£j of Borel sets in R k is 
said to shrink to x nicely if there is a number a > 0 with the following property: 
There is a sequence of balls B(x, r f ), with lim r t = 0, such that E { c= B(x, r £ ) and 

m{E t ) > a • m(B(x, rf) (1) 


for i = 1, 2, 3, ... . 

Note that it is not required that x e E i9 nor even that x be in the closure of 
E t . Condition (1) is a quantitative version of the requirement that each E t must 
occupy a substantial portion of some spherical neighborhood of x. For example, 
a nested sequence of k-cells whose longest edge is at most 1,000 times as long as 
its shortest edge and whose diameter tends to 0 shrinks nicely. A nested sequence 
of rectangles (in R 2 ) whose edges have lengths 1/i and (1/i) 2 does not shrink 
nicely. 

7.10 Theorem Associate to each x e R k a sequence {E^x)} that shrinks to x 
nicely , and let f e l}(R k ). Then 

f(x) = lim 1 f f dm (1) 

i-.no jEi(x) 


at every Lebesgue point of f hence a.e. [m]. 



DIFFERENTIATION 141 


Proof Let x be a Lebesgue point of / and let a(x) and B(x , r £ ) be the positive 
number and the balls that are associated to the sequence {/^(x)}. Then, 
because £*(x) c= £(x, r £ ), 

f I/-/WI dm < — - f I f-f{x)\dm. 

MEM) Jem m(B(x, rj) J B(x>ri) 

The right side converges to 0 as i — > oo, because r f — > 0 and x is a Lebesgue 
point of /. Hence the left side converges to 0, and (1) follows. //// 

Note that no relation of any sort was assumed to exist between {E^x)} and 
{ Efy )}, for different points x and y. 

Note also that Theorem 7.10 leads to a correspondingly stronger form of 
Theorem 7.8. We omit the details. 


7.11 Theorem Iff e L l (R l ) and 

F(x) — I f dm (— oo < x < oo ), 

J— 00 

then F'(x) = /(x) at every Lebesgue point of f hence a.e. [m]. 

(This is the easy half of the fundamental theorem of Calculus, extended to 
Lebesgue integrals.) 

Proof Let {<5J be a sequence of positive numbers that converges to 0. 
Theorem 7.10, with E^x) = [x, x + <5J, shows then that the right-hand deriv- 
ative of F exists at all Lebesgue points of x of / and that it is equal to / (x) at 
these points. If we let E t (x) be [x — Si , x] instead, we obtain the same result 
for the left-hand derivative of F at x. //// 


7.12 Metric density Let E be a Lebesgue measurable subset of R k . The metric 
density of £ at a point x e R k is defined to be 


m(E n B(x, r )) 

lim — — — 

r _*o rn{B{x 9 r )) 


( 1 ) 


provided, of course, that this limit exists. 

If we let / be the characteristic function of E and apply Theorem 7.8 or 
Theorem 7.10, we see that the metric density of E is 1 at almost every point of £, 
and that it is 0 at almost every point of the complement of E. 

Here is a rather striking consequence of this, which should be compared with 
Exercise 8 in Chap. 2: 

If e > 0, there is no set E a R 1 that satisfies 


m(E n I) 
€ < nil) 


< 1 


6 


(2) 


for every segment /. 
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Having dealt with differentiation of absolutely continuous measures, we now 
turn to those that are singular with respect to m. 


7.13 Theorem Associate to each x e R k a sequence {£ f (x)} that shrinks to x 
nicely. If p is a complex Borel measure and pi. m, then 


lim^-0 

i- CO W(£;(X)) 


a.e. [m]. 


(i) 


Proof The Jordan decomposition theorem shows that it suffices to prove (1) 
under the additional assumption that p > 0. In that case, arguing as in the 
proof of Theorem 7.10, we have 

<x(x)p(Ei(x)) < p(Ej(x)) < p(B(x , r t )) 
m(Ei(x)) ~ m(B(x , rf) ~ m(B(x , rf) * 

Hence (1) is a consequence of the special case 

(Dp)(x) = 0 a.e. [m], (2) 

which will now be proved. 

The upper derivative Dp, defined by 

(D/i)(x) = lim T sup (£> r //)(x)l (x e R k ) (3) 

n~* oo LO <r< l/n J 

is a Borel function, because the quantity in brackets decreases as n increases 
and is, for each n, a lower semicontinuous function of x; the reasoning used 
in Sec. 7.2 proves this. 

Choose X > 0, 6 > 0. Since p _L m, p is concentrated on a set of Lebesgue 
measure 0. The regularity of p (Theorem 2.18) shows therefore that there is a 
compact set K, with m(K) = 0, p(K) > \\p\\ — e. 

Define Pi(E) = p(K n E), for any Borel set E a R k , and put p 2 = p — p^ 
Then \\p 2 \\ < e, and, for every x outside K, 


(Dn)(x) = (Dpi 2 X x ) < (Mfi 2 )(x). 

( 4 ) 

Hence 

{Dfi > A} <= K u {Mfi 2 > A}, 

( 5 ) 

and Theorem 7.4 shows that 

m{Dp >X}< 3 k X~ l \\p 2 \\ < 3*A _1 6. 

(6) 

Since (6) holds for every e > 0 and for every X > 0, 

we conclude that 

Dp = 0 a.e. [ni], i.e., that (2) holds. 

llll 


Theorems 7.10 and 7.13 can be combined in the following way: 
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7.14 Theorem Suppose that to each x e R k is associated some sequence {E t (x)} 
that shrinks to x nicely , and that p is a complex Borel measure on R k . 

Let dp=f dm + dp s be the Lebesgue decomposition of p with respect to m. 
Then 


lim 

i-* oo 


m(E t {x)) 


= /(*) 


a.e. [m]. 


In particular , p 1 m if and only if{Dp\x) = 0 a.e. [m]. 


The following result contrasts strongly with Theorem 7.13: 

7.15 Theorem If p is a positive Borel measure on R k and p _L m 9 then 

(DMx) = oo a.e. [>]. (1) 

Proof There is a Borel set S <=. R k with m(S) = 0 and /j(R k — S) = 0, and 

there are open sets Vj => S with m(Vj) < 1/j, for j = 1, 2, 3, 

For N = 1, 2, 3, ..., let E N be the set of all x e S to which correspond 
radii r t = r t (x), with lim r t = 0, such that 

r,)) < Nm(B(x, r,)). (2) 

Then (1) holds for every x e S — \J E N . 

N 

Fix N and j 9 for the moment. Every x e E N is then the center of a ball 
B x c= 1^. that satisfies (2). Let P x be the open ball with center x whose radius is 
1/3 of that of B x . The union of these balls P x is an open set W j N that con- 
tains E n and lies in Vj . We claim that 

p(W hN )<3 k N/f (3) 

To prove (3), let K a W j N be compact. Finitely many P x cover K. 
Lemma 7.3 shows therefore that there is a finite set F c= E N with the follow- 
ing properties: 

(a) {P x - x e F} is a disjoint collection, and 

(b) K c: (J B x . 

xeF 

Thus 

KK) < Z H(B X ) <N Z m(B x ) 

xeF xeF 

= 3 k N Z "#*) < 3 k Nm(Vj) < 3 k N/j. 

xeF 

This proves (3). 

Now put Cl N = f) W j N . Then E N <=£i N , C1 N is a G } , n(Cl N ) = 0, and 

j 

(E>ntx) = oo at every point of S — JJ Q N . I III 

N 
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The Fundamental Theorem of Calculus 

7.16 This theorem concerns functions defined on some compact interval [a, b] in 
R 1 . It has two parts. The first asserts, roughly speaking, that the derivative of the 
indefinite integral of a function is that same function. We dealt with this in 
Theorem 7.11. The second part goes the other way: one returns to the original 
function by integrating its derivative. More precisely 

f(x) -f(a) = jV(t) dt (a < x < b). (1) 

In the elementary version of this theorem, one assumes that / is differentiable at 
every point of [a, b] and that /' is a continuous function. The proof of (1) is then 
easy. 

In trying to extend (1) to the setting of the Lebesgue integral, questions such 
as the following come up naturally: 

Is it enough to assume that /' e L 1 , rather than that /' is continuous? 

If / is continuous and differentiable at almost all points of [a, b], must (1) 
then hold? 

Before proving any positive results, here are two examples that show how (1) 
can fail. 

(a) Put f(x) = x 2 sin (x -2 ) if x ^ 0, /( 0) = 0. Then / is differentiable at every 
point, but 


r 


\f'(t) \ dt = oo, 


(2) 


so/' * L 1 . 

If we interpret the integral in (1) (with [0, 1] in place of [ a , b]) as the 
limit, as €— ► 0, of the integrals over [e, 1], then (1) still holds for this /. 

More complicated situations can arise where this kind of passage to the limit 
is of no use. There are integration processes, due to Denjoy and Perron (see [18], 
[28]), which are so designed that (1) holds whenever / is differentiable at every 
point. These fail to have the property that the integrability of / implies that of 
| / 1 , and therefore do not play such an important role in analysis. 

(b) Suppose / is continuous on [a, b], / is differentiable at almost every point of 
[ a , b], and /' e L 1 on [a, b]. Do these assumptions imply that (1) holds? 
Answer : No. 

Choose {<5„} so that 1 = S 0 > > S 2 > • • *, <5„— ► 0. Put E 0 = [0, 1], 

Suppose n > 0 and E n is constructed so that E n is the union of 2” disjoint 
closed intervals, each of length 2 ~ n S n . Delete a segment in the center of each 
of these 2 n intervals, so that each of the remaining 2 W+1 intervals has length 



DIFFERENTIATION 145 


2 " l 3 n + 1 (this is possible, since 3 n + 1 < 3 n ), and let E n + l be the union of 
these 2 n + 1 intervals. Then E t => E 2 => * * * , m(£„) = <5„ , and if 


E=f]E n , (3) 

> 1=1 

then E is compact and m(E) = 0. (In fact, E is perfect.) Put 

9n = <5„' ‘xe. and /„(x) = 3 „(f) dt (n = 0, 1, 2, . . .)• (4) 

Then /„( 0) = 0,/„(l) = 1, and each f n is a monotonic function which is con- 
stant on each segment in the complement of E n . If / is one of the 2 n intervals 
whose union is E n , then 

^g n (t)dt=[g n+1 (t)dt = 2-". (5) 

It follows from (5) that 

/n+lM = /»(*) (**£„) (6) 


and that 


!/»(*) -/»+i(x) I ^ 


-3n+ll < 2' 


(x e £„). 


(7) 


Hence {/„} converges uniformly to a continuous monotonic function f with 
/( 0) = 0,/(l) = 1, and f'(x) = 0 for all x $ E. Since m(E) = 0, we have f = 0 
a.e. 

Thus (1) fails. 

If 3 n = (2/3) w , the set E is Cantor’s “ middle thirds ” set. 

Having seen what can go wrong, assume now that /' e L 1 and that (1) does 
hold. There is then a measure \i, defined by d\i — /' dm. Since \i m, 
Theorem 6.11 shows that there corresponds to each €>0a<5>0so that 
| \i | (E) < e whenever E is a union of disjoint segments whose total length is less 
than 3. Since f{y) — fix) = /x((x, y)) if a < x < y < b, it follows that the absolute 
continuity of /, as defined below, is necessary for (1). Theorem 7.20 will show that 
this necessary condition is also sufficient. 


7.17 Definition A complex function /, defined on an interval / = [ a , b], is 
said to be absolutely continuous on / (briefly, / is AC on I) if there corre- 
sponds to every e>0a<5>0so that 

i \m-n*d\<* 

i= 1 


(i) 
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for any n and any disjoint collection of segments (a l9 ff), (a„, /„) in / 
whose lengths satisfy 


I (Pi - <*,) < 6. (2) 

i = 1 

Such an /is obviously continuous: simply take n = 1. 

In the following theorem, the implication (b)— >(c) is probably the most 
interesting. That (a)— >(c) without assuming monotonicity of /is the content 
of Theorem 7.20. 


7.18 Theorem Let I = [a, b], /et /: /— ► K 1 he continuous and nondecreasing. 
Each of the following three statements about f implies the other two: 

(a) f is AC on /. 

(b) f maps sets of measure 0 to sets of measure 0. 

(c) / is differentiable a.e. on I,f' e L 1 , and 


f(x) —f(a) = 


f\t) dt 


(a < x < b). 


(1) 


Note that the functions constructed in Example 7.16(h) map certain compact 
sets of measure 0 onto the whole unit interval ! 

Exercise 12 complements this theorem. 

Proof We will show that (a )— > (h)— ► (c)—> (a). 

Let $01 denote the a-algebra of all Lebesgue measurable subsets of R 1 . 
Assume / is AC on /, pick £c/so that E e $0? and m(E) = 0. We have to 
show that f(E) e $01 and m(f(E)) = 0. Without loss of generality, assume that 
neither a nor h lie in E. 

Choose 6 > 0. Associate S > 0 to / and 6, as in Definition 7.17. There is 
then an open set V with m(V) < S , so that E <= V a /. Let (a f , be the 
disjoint segments whose union is V. Then £ — a,) < S , and our choice of S 

shows that therefore 


I (f(Pi) -/(«J) < (2) 

i 

[Definition 7.17 was stated in terms of finite sums; thus (2) holds for every 
partial sum of the (possibly) infinite series, hence (2) holds also for the sum of 
the whole series, as stated.] 

Since E cz V,f(E) cz (J [/(a f ), /(&)]. The Lebesgue measure of this union 
is the left side of (2). This says that / (E) is a subset of Borel sets of arbitrarily 
small measure. Since Lebesgue measure is complete, it follows that f(E) e $01 
and m(f (E)) = 0. 

We have now proved that (a) implies (b). 
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Assume next that (b) holds. Define 

g(x) = x + /(x) (a < x < b). (3) 

If the /-image of some segment of length rj has length rj\ then the 0-image of 
that same segment has length rj + rj f . From this it follows easily that g 
satisfies (fr), since / does. 

Now suppose E c /, E e 901. Then E = E^ u E 0 where m(E 0 ) = 0 and E l 
is an F a (Theorem 2.20). Thus E t is a countable union of compact sets, and 
so is g(E t ), because g is continuous. Since g satisfies ( b ), m(g(E 0 )) = 0. Since 
g(E) = g(E t ) u g(E 0 ), we conclude: g(E) e 9TC. 

Therefore we can define 

fi(E) = m(g(E)) (E c /, E e W). (4) 

Since g is one-to-one (this is our reason for working with g rather than /), 
disjoint sets in / have disjoint 0-images. The countable additivity of m shows 
therefore that \i is a (positive, bounded) measure on $01. Also, \i m, because 
g satisfies (b). Thus 


d\i — h dm (5) 

for some h e l}(m\ by the Radon-Nikodym theorem. 

If E = [a, x], then g(E) = [g{a), 0(x)], and (5) gives 

g(x) - g(a) = m(g{E)) = g(E) = \h dm = h(t) dt. 

JE Ja 

If we now use (3), we conclude that 

/(x) -/(a) = £ [h(t) - 1] dt (a < x < b). (6) 

Thus f\x) — h(x) — 1 a.e. [m], by Theorem 7.11. 

We have now proved that (b) implies (c). 

The discussion that preceded Definition 7.17 showed that (c) implies (a). 

mi 


7.19 Theorem Suppose f: /— ► R 1 is AC, / = [a, b]. Define 

F(x) = sup X I f(ti) 1) I (a < x < b) (1) 

i= 1 

where the supremum is taken over all N and over all choices of (tj such that 

a = t 0 < ti < • • • < t N = x. (2) 


The functions F, F + f F — / are then nondecreasing and AC on I. 
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[F is called the total variation function of /. If /is any (complex) function on /, 
AC or not, and F(b) < oo, then / is said to have bounded variation on /, and F(b) 
is the total variation of / on /. Exercise 13 is relevant to this.] 


Proof If (2) holds and x < y <b, then 

m > i m -m i + £ i m ,) i . o) 

i = 1 

Hence F(y) > \f(y) — f(x)\ + F(x). In particular 

F(y) >f(y) -f(x) + F(x) and F(y) >/(*) -/(y) + F(x). (4) 

This proves that F, F + / F — / are nondecreasing. 

Since sums of two AC functions are obviously AC, it only remains to be 
proved that F is AC on /. 

If (a, P) cz I then 

m - F( a) = sup £ I m -/(r,_x) | , (5) 

1 

the supremum being taken over all {tj that satisfy a = t 0 < • • • < t H = fi. 

Note that £ x ) = ft — a. 

Now pick e > 0, associate d > 0 to f and e as in Definition 7.17, choose 
disjoint segments (a,-, /?,•) c / with £ (// — a < <5, and apply (5) to each 
(a 7 -, fij). It follows that 

Z (F(pj) - F(dj)) < e, (6) 

j 

by our choice of 3. Thus F is AC on /. //// 

We have now reached our main objective : 


7.20 Theorem If f is a complex function that is AC on I = [ a , b], then f is 
differentiable at almost all points of Iff' e l}(m ), and 


fix) -fia) = 


fit) dt 


Ja 


(a < x < b). 


( 1 ) 


Proof It is of course enough to prove this for real /. Let F be its total 
variation function, as in Theorem 7.19, define 

fi=&F+f), f 2 = -/), (2) 

and apply the implication (a)— > (c) of Theorem 7.18 to j\ and f 2 . Since 

/=/i ~f 2 


this yields (1). 


( 3 ) 

llll 
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The next theorem derives (1) from a different set of hypotheses, by an entirely 
different method of proof. 

7.21 Theorem Iff : [a, ft ]— > R 1 is differentiable at every point of [a, b] and 
/' e l } on [a, ft], t/ien 

/(*) -/(a) = JV(0 dt (a < x < b). (1) 

Note that differentiability is assumed to hold at point of [a, ft]. 

Proof It is clear that it is enough to prove this for x = b. Fix e > 0. 
Theorem 2.25 ensures the existence of a lower semicontinuous function g on 
[a, b] such that g > /' and 

f g{t) dt < j* /'(t) df + e. (2) 

Ja Ja 

Actually, Theorem 2.25 only gives g > /', but since m([a, h]) < oo, we can 
add a small constant to g without affecting (2). For any rj > 0, define 

F n( x ) = £ 9(t) dt -f(x) +f(a) + t](x - a) (a < x < b). (3) 

Keep rj fixed for the moment. To each x e [a, b) there corresponds a d x > 0 
such that 


g(t) >/'(*) and ^ ^ </'(*) + n 


( 4 ) 


for all t e (x, x + 5 X ), since g is lower semicontinuous and g(x) > f\x). For 
any such t we therefore have 

FJLt) - F,(X) = |V(s) ds - [/(f) -/(x)] + n(t - X) 

> (f - x)/'(x) - (t - x)[/'(x) + >/] + f/(f - x) = 0. 

Since F^a) = 0 and F n is continuous , there is a last point x e [a, h] at which 
F^x) = 0. If x < b, the preceding computation implies that F^t) > 0 for 
t e (x, ft]. In any case, F^b) > 0. Since this holds for every rj > 0, (2) and (3) 
now give 


m —f(a) < 


g(t) dt < f f\t) dt + e, 
Ja Ja 


( 5 ) 


and since e was arbitrary, we conclude that 
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If / satisfies the hypotheses of the theorem, so does — /; therefore (6) 
holds with —fin place of f and these two inequalities together give (1). I HI 


Differentiable Transformations 


7.22 Definitions Suppose V is an open set in R k , T maps V into R k , and 
x e V. If there exists a linear operator A on R k (i.e., a linear mapping of R k 
into R k , as in Definition 2.1) such that 


lim 

h ->0 


\T(x + h)-T(x)-Ah\ 

\h\ 


= 0 


(1) 


(where, of course, h e R k ), then we say that T is differentiable at x, and define 


T\x) = A. (2) 

The linear operator T'(x) is called the derivative of T at x. (One shows 
easily that there is at most one linear A that satisfies the preceding require- 
ments; thus it is legitimate to talk about the derivative of T.) The term differ- 
ential is also often used for T'(x). 

The point of (1) is of course that the difference T(x + h) — T(x) is 
approximated by T'(x)h , a linear function of h. 

Since every real number a gives rise to a linear operator on R 1 (mapping 
h to ah), our definition of T'(x) coincides with the usual one when k = 1. 

When A: R k — > R k is linear, Theorem 2.20(e) shows that there is a number 
A(4) such that 


m(A(E)) = A(A)m(E) (3) 

for all measurable sets E a R k . Since 


A\x) = A (x e R k ) (4) 

and since every differentiable transformation T can be locally approximated 
by a constant plus a linear transformation, one may conjecture that 


m(T(E)) 

m(E) 


- A (T(x)) 


( 5 ) 


for suitable sets E that are close to x. This will be proved in Theorem 7.24, 
and furnishes the motivation for Theorem 7.26. 

Recall that A(4) = | det A | was proved in Sec. 2.23. When T is differen- 
tiable at x, the determinant of T'(x) is called the Jacobian of T at x, and is 
denoted by J T (x). Thus 


A(r(x)) = |J r (x)|. (6) 

The following lemma seems geometrically obvious. Its proof depends on the 
Brouwer fixed point theorem. One can avoid the use of this theorem by imposing 
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stronger hypotheses on F, for example, by assuming that F is an open mapping. 
But this would lead to unnecessarily strong assumptions in Theorem 7.26. 

7.23 Lemma Let S = {x: |x| = 1} be the sphere in R k that is the boundary of 
the open unit ball B = B( 0, 1). 

// F: B— > R k is continuous , 0 < e < 1, and 

| F(x) - x | < e (1) 

for all x e S, then F(B) z> B( 0, 1 — e). 

Proof Assume, to reach a contradiction, that some point a e B( 0, 1 — e) is 
not in F(B). By (1), | F(x) | > 1 — e if x e S. Thus a is not in F(S), and there- 
fore a F(x), for every x e B. This enables us to define a continuous map 
G: B— > B by 


G(x) = 


a — F(x) 
\a-F(x)\' 


If x e S, then x • x = | x | 2 = 1 , so that 


(2) 


x • (a — F(x)) = x • a + x • (x — F(x)) — 1 < | a | + e — 1 < 0. (3) 

This shows that x • G(x) < 0, hence x ^ G(x). 

If x e B, then obviously x ^ G(x), simply because G(x) e S. 

Thus G fixes no point of 5, contrary to Brouwer’s theorem which states 
that every continuous map of B into B has at least one fixed point. //// 


A proof of Brouwer’s theorem that is both elementary and simple may be 
found on pp. 38-40 of “Dimension Theory” by Hurewicz and Wallman, 
Princeton University Press, 1948. 


7.24 Theorem If 

(a) V is open in R k , 

(b) T : V — > R k is continuous , and 

(c) T is differentiable at some point x e V, then 


lim 

r-0 


m(T(B(x, r))) 
m(B(x , r)) 


= A(T'(x)). 


( 1 ) 


Note that T(B(x , r)) is Lebesgue measurable; in fact, it is a-compact, because 
£(x, r) is ^-compact and T is continuous. 


Proof Assume, without loss of generality, that x = 0 and T(x) = 0. Put 
A = T\ 0). 

The following elementary fact about linear operators on finite- 
dimensional vector spaces will be used: A linear operator A on R k is one-to- 
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one if and only if the range of A is all of R k . In that case, the inverse A ~ 1 of A 
is also linear. 

Accordingly, we split the proof into two cases. 


Case i A is one-to-one. Define 


F(x) = A~ 1 T(x) (xeV). (2) 

Then F'( 0) = A~ l T'(0) = A -1 A = /, the identity operator. We shall prove 
that 


m(F(B(0, r))) _ 
,~o m(B(0,r)) 


( 3 ) 


Since T(x) = AF(x ), we have 

m(T(B)) = m(A(F(B))) = A (A)m(F(B)) (4) 

for every ball B , by 7.22(3). Hence (3) will give the desired result. 

Choose e > 0. Since F( 0) = 0 and F'( 0) = /, there exists a 8 > 0 such that 
0 < | x | < S implies 


| F(x) — x | < e | x | . 


( 5 ) 


We claim that the inclusions 


B( 0, (1 - e)r) c F(B( 0, r)) c z B( 0, (1 + e)r) (6) 

hold if 0 < r < d. The first of these follows from Lemma 7.23, applied to 
B( 0, r) in place of B( 0, 1), because | F(x) — x | < er for all x with | x | = r. The 
second follows directly from (5), since | F(x) | < (1 + e) | x | . It is clear that (6) 
implies 


(1 - ef < 


m(F(g(0, r))) 
m(B(0, r)) 


< (1 + e)* 


( 7 ) 


and this proves (3). 

Case ii A is not one-to-one. In this case, A maps R k into a subspace of lower 
dimension, i.e., into a set of measure 0. Given e > 0, there is therefore an 
rj > 0 such that m(E J < e if E n is the set, of all points in R k whose distance 
from A(B( 0, 1)) is less than rj. Since A = T'( 0), there is a d > 0 such that 
| x | < 5 implies 


| T(x) — Ax | < rj | x | . (8) 

If r < d, then T(B( 0, r)) lies therefore in the set E that consists of the points 
whose distance from A(B( 0, r)) is less than rjr. Our choice of rj shows that 
m(E) < er k . Hence 


m(T(B( 0, r))) <er k (0 < r < d). 


( 9 ) 
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Since r k = m(B( 0, r))/m(B( 0, 1)), (9) implies that 

ntnm r))) _ n 

r-*o rn(B(0, r)) 

This proves (1), since A(T'(0)) = A(,4) = 0. 


7.25 Lemma Suppose E cz R k , m(F) = 0, T maps E into R k , and 

|T(y)-T(x)| 

lim sup — — < oo 

\y-x\ 

for every x e F, as y tends to x within E. 

Then m(T(E)) = 0. 


( 10 ) 

//// 


Proof Fix positive integers n and p, let F = F n p be the set of all x e E such 
that 


I T(y) — T(x) | < n \ y — x\ 

for all y e B(x , 1/p) n E, and choose e > 0. Since m(F) = 0, F can be covered 
by balls B ( = B(x i9 r f ), where x t e F, r t < 1/p, in such a way that m(F I ) < 6. 
(To do this, cover F by an open set W of small measure, decompose W into 
disjoint boxes of small diameter, as in Sec. 2.19, and cover each of those that 
intersect F by a ball whose center lies in the box and in F.) 

If x e F n B { then | x { — x \ < r { < 1/p and x f e F. Hence 

I T(x t ) - T(x)| < n\xt - x| < nr; 

so that T(F n B t ) a B(T(Xi), nr/). Therefore 

T(F) <= U B(T( Xi ), nr,). 

i 

The measure of this union is at most 

X m(B(T( Xi ), nr,) = n k £ m(B,) < n k e. 

i i 

Since Lebesgue measure is complete and e was arbitrary, it follows that T(F) 
is measurable and m(T(F)) = 0. 

To complete the proof, note that E is the union of the countable collec- 

ti0n { F n.p}- llll 

Here is a special case of the lemma: 

If V is open in R k and T : V — > R k is differentiable at every point of V, then T 
maps sets of measure 0 to sets of measure 0. 

We now come to the change-of-variables theorem. 

7.26 Theorem Suppose that 
(i) X a V a R k , V is open , T: V — ► R k is continuous ; 



154 REAL AND COMPLEX ANALYSIS 


(ii) X is Lebesgue measurable , T is one-to-one on X , and T is differentiable at 
every point of X ; 

(iii) m(T(V - X)) = 0. 

Then , setting Y = T(2Q, 

J7dm= £(/<> T)|J r Mm (1) 

/or euery measurable /: ► [0, oo]. 

The case X = K is perhaps the most interesting one. As regards condition 
(iii), it holds, for instance, when m(V — X) = 0 and T satisfies the hypotheses of 
Lemma 7.25 on V — X. 

The proof has some elements in common with that of the implication 
(fr)— ► (c) in Theorem 7.18. 

It will be important in this proof to distinguish between Borel sets and 
Lebesgue measurable sets. The o-algebra consisting of the Lebesgue measurable 
subsets of R k will be denoted by $01. 

Proof We break the proof into the following three steps: 

(I) If E e $01 and E aV , then T(E) e $01. 

(II) For every E e $01, 

/• 

m(T(E n X)) = X eUt\ dm. 

JX 

(III) For every A e $01, 

f Xa dm = (x A ° T)\J T \dm. 

JY JX 

If E 0 e $01, E 0 a V , and m(E 0 ) = 0, then m(T(E 0 — X)) = 0 by (iii), and 
m(T(E 0 n X)) = 0 by Lemma 7.25. Thus m(T(E 0 )) = 0. 

If E l a V is an F ff , then E t is a-compact, hence T(E t ) is a-compact, 
because T is continuous. Thus T(E t ) e $01. 

Since every E e $01 is the union of an F a and a set of measure 0 
(Theorem 2.20), (I) is proved. 

To prove (II), let n be a positive integer, and put 

V n = {xe V:\T(x)\<n}, X n = Xn V n . (2) 

Because of (I), we can define 

Mn (E) = m(T(E n X n )) (E e $01). (3) 

Since T is one-to-one on X n , the countable additivity of m shows that p n is a 
measure on $01. Also, p n is bounded (this was the reason for replacing X 
temporarily by X n ), and \i n m, by another application of Lemma 7.25. 
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Theorem 7.8 tells us therefore that (Dp n )(x) exists a.e. [m], that Dp n e L}(m), 
and that 


ii n {E) = J (Dp n ) dm (E e Wl). (4) 

We claim next that 

(Dp n )(x) = \J T (x)\ (xeX n ). (5) 

To see this, fix x e X n , and note that B(x , r) a V n for all sufficiently small 
r > 0, because V„ is open. Since V„ — X„ a V — X, hypothesis (iii) enables us 
to replace X n by V n in (3) without changing p n (E). Hence, for small r > 0, 

li„(B(x, r)) = m(T(B(x, r))). (6) 

If we divide both sides of (6) by m(B(x , r)) and refer to Theorem 7.24 and 
formula 7.22(6), we obtain (5). 

Since (3) implies that p n {E) = p n (E n X n ), it follows from (3), (4), and (5) 

that 

m(T(E n X„)) = £ I Jr I dm (E e «R). (7) 

If we apply the monotone convergence theorem to (7), letting n— ► oo, we 
obtain (II). 

We begin the proof of (III) by letting A be a Borel set in R k . Put 

E = T~\A) = {xeV: T(x) e A}. (8) 

Then Xe = Xa 0 T. Since Xa j s a Borel function and T is continuous, Xe i s a 
Borel function (Theorem 1.12), hence E e SUl. Also 

T(E n X) — A n Y (9) 

which implies, by (II), that 

jxA dm = m(T(E n X)) = jh A » T) \J T \ dm. (10) 

Finally, if N eWl and m(N) = 0, there is a Borel set A zd N with 
m(A) = 0. For this A, (10) shows that (x A ° T)| J r | = 0 a.e. [m]. Since 0 < 
Xn ^ Xa ^ it follows that both integrals in (10) are 0 if A is replaced by N. 
Since every Lebesgue measurable set is the disjoint union of a Borel set and a 
set of measure 0, (10) holds for every A e This proves (III). 

Once we have (III), it is clear that (1) holds for every nonnegative 
Lebesgue measurable simple function /. Another application of the monotone 
convergence theorem completes the proof. //// 
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Note that we did not prove that / ° T is Lebesgue measurable for all 
Lebesgue measurable /. It need not be; see Exercise 8. What the proof does estab- 
lish is the Lebesgue measurability of the product (/ ° T)\J t \. 

Here is a special case of the theorem: 

Suppose (p : [a, h] — ► [a, /?] is AC, monotonic , cp(a) = a, cp(b) = /?, and f > 0 is 
Lebesgue measurable. Then 

f fit) dt = f f(<p(x))<p\x) dx. (15) 

Ja Ja 

To derive this from Theorem 7.26, put V = (a, b), T = <p, let Q be the union 
of the maximal segments on which cp is constant (if there are any) and let X be 
the set of all x e V — Q where cp\x) exists (and is finite). 

Exercises 

1 Show that | f{x) \ < (. Mf\x ) at every Lebesgue point of/if/e l}(R k ). 

2 For S > 0, let I(S) be the segment ( — <5, <5) c: R l . Given a and /?, 0 < a < /? < 1, construct a measur- 
able set E c: R l so that the upper and lower limits of 

m(E n I(S)) 

2 S 


are /? and a, respectively, as S — > 0. 

(Compare this with Section 7.12.) 

3 Suppose that E is a measurable set of real numbers with arbitrarily small periods. Explicitly, this 
means that there are positive numbers p, , converging to 0 as i— > oo, so that 

£ + p f = £ (i= 1,2,3,...). 

Prove that then either E or its complement has measure 0. 

Hint: Pick a e R 1 , put F{x ) = m{E n [a, x]) for x > a, show that 


F(x + - F(x - = F{y + p^ - F(y - Pi ) 


if a + Pi < x < y. What does this imply about F\x) if m(E) >0? 

4 Call t a period of the function /on R 1 if/(x + t) =/(x) for all x e R 1 . Suppose / is a real Lebesgue 
measurable function with periods s and t whose quotient s/t is irrational. Prove that there is a con- 
stant c such that /(x) = c a.e., but that / need not be constant. 

Hint : Apply Exercise 3 to the sets {/> X). 

5 If A c: R 1 and B a R 1 , define A + B = {a + b: a e A, b e B}. Suppose m(A) > 0, m(B) > 0. Prove 
that A + B contains a segment, by completing the following outline. 

There are points a 0 and b 0 where A and B have metric density 1. Choose a small S > 0. Put 
c 0 = a 0 + b 0 . For each €, positive or negative, define B ( to be the set of all c 0 + € — b for which b e B 
and | b — b 0 \ < S. Then B ( a (a 0 + € — <5, a 0 + € 4- <5). If S was well chosen and | e \ is sufficiently 
small, it follows that A intersects B { , so that A + B (c 0 — € 0 , c 0 + € 0 ) for some e 0 > 0. 

Let C be Cantor’s “ middle thirds ” set and show that C + C is an interval, although m(C) = 0. 
(See also Exercise 19, Chap. 9.) 

6 Suppose G is a subgroup of R 1 (relative to addition), G ^ R 1 , and G is Lebesgue measurable. Prove 
that then m(G) = 0. 

Hint : Use Exercise 5. 
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7 Construct a continuous monotonic function / on R 1 so that / is not constant on any segment 
although f\x ) = 0 a.e. 

8 Let V = (a, b) be a bounded segment in R 1 . Choose segments W n c: V in such a way that their 
union W is dense in V and the set K = V — W has m(K) > 0. Choose continuous functions <p n so that 
(p n (x) = 0 outside W n , 0 < <p n {x) < 2~ n in W n . Put (p = £ <p n and define 

T(x) = J* q>(t) dt (a < x < b). 

Prove the following statements : 

(a) T satisfies the hypotheses of Theorem 7.26, with X = V. 

(b) T is continuous, T'(x) = 0 on K t m(T(K)) = 0. 

(c) If £ is a nonmeasurable subset of K (see Theorem 2.22) and A = T(£), then Xa * s Lebesgue 
measurable but Xa° T is not. 

(i d ) The functions (p n can be so chosen that the resulting T is an infinitely differentiable homeo- 
morphism of V onto some segment in R l and (c) still holds. 

9 Suppose 0 < a < 1. Pick t so that t* = 2. Then t > 2, and the construction of Example ( b ) in 
Sec. 7.16 can be carried out with S n = (2 /t) n . Show that the resulting function /belongs to Lip a on 
[0, 1]. 

10 If/e Lip 1 on [a, b], prove that /is absolutely continuous and that /' e L 00 . 

11 Assume that 1 < p < oo,/is absolutely continuous on [a, b],f e If, and a = 1/q, where q is the 
exponent conjugate to p. Prove that / e Lip a. 

12 Suppose (p : [a, 6]— * R 1 is nondecreasing. 

(i a ) Show that there is a left-continuous nondecreasing / on [a, 6] so that {/#<?} is at most 
countable. [Left-continuous means: if a<x<b and e > 0, then there is a <5 > 0 so that 
l/( x ) “ /(* — 01 < € whenever 0 < t < <5.] 

(b) Imitate the proof of Theorem 7.18 to show that there is a positive Borel measure p on [u, b~\ 
for which 


/(*) -/(a) = Ml>> *)) (a < x < 6). 

(c) Deduce from ( b ) that f\x) exists a.e. [m], that /' e L‘(m), and that 

/(x) —/(a) = /'(f) dt + s(x) (a < x < b) 

where s is nondecreasing and s'(x) = 0 a.e. [m]. 

(i d) Show that p 1 m if and only if f\x) = 0 a.e. [m], and that p <m if and only if /is AC on 
fabl 

(e) Prove that (p'{x) = f\x) a.e. [m]. 

13 Let BV be the class of all / on [a, 6] that have bounded variation on [a, b], as defined after 
Theorem 7.19. Prove the following statements. 

(a) Every monotonic bounded function on [a, 6] is in BV. 

(b) If f e BV is real, there exist bounded monotonic functions f t and f 2 so that f =f t — f 2 • 

Hint : Imitate the proof of Theorem 7.19. 

(c) If f e BV is left-continuous then f t and f 2 can be chosen in ( b ) so as to be also left- 
continuous. 

(d) If/ e BV is left-continuous then there is a Borel measure p on [a, 6] that satisfies 

/(X) —f(a) = #i(0> x)) (a < X ^ b)\ 

p <^m if and only if /is AC on [a, b]. 

{e) Every /e BV is differentiable a.e. [m], and /' e I}{m). 

14 Show that the product of two absolutely continuous functions on [a, 6] is absolutely continuous. 
Use this to derive a theorem about integration by parts. 
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15 Construct a monotonic function /on R 1 so that fix ) exists (finitely) for every x e R 1 , but /' is not 
a continuous function. 

16 Suppose E c [a, b~\, m(E) = 0. Construct an absolutely continuous monotonic function / on [a, b~\ 
so that f\x ) = oo at every x e E. 

Hint : E a f) V n , V„ open, m(V n ) a 2~ n . Consider the sum of the characteristic functions of these 

sets. 

17 Suppose {fi n } is a sequence of positive Borel measures on R k and 


m = £ fijE). 

n= 1 

Assume p{R k ) < oo. Show that p is a Borel measure. What is the relation between the Lebesgue 
decompositions of the p n and that of pi 
Prove that 


(£Wto = £ (£>/*„X*) a.e. [m], 

n=l 

Derive corresponding theorems for sequences {/„} of positive nondecreasing functions on R 1 and 
their sums /=£/,. 

18 Let (p 0 (t) = 1 on [0, 1), <p Q {t ) = — 1 on [1, 2), extend (p 0 to R 1 so as to have period 2, and define 
<P n (t) = <PoV n t)> n = 1,2,3,.... 

Assume that ]T | c„ | 2 < oo and prove that the series 

i c„<p„w c ) 

n- 1 


converges then for almost every t. 

Probabilistic interpretation: The series £ (±c„) converges with probability 1. 

Suggestion : {(p n } is orthonormal on [0, 1], hence (*) is the Fourier series of some / e L 2 . If 
a =j • 2 ~ n , b — (/ + 1) • 2 ~ N , a <t < b, and s N = + • • • + c^^jthen, for n> N, 

1 1 C b 

S M = 7 s N dm = 7 s n dm, 

b-aj a b - a J a 

and the last integral converges to / dm, as n— > oo. Show that (*) converges to f{t) at almost every 
Lebesgue point of /. 

19 Suppose /is continuous on R 1 ,f(x) > 0 if 0 < x < 1 ,f{x) — 0 otherwise. Define 

h c (x) — sup { n c f{nx ): n — 1, 2, 3, ...}. 


Prove that 

(a) h c is in 1}(R X ) if 0 < c < 1, 

(b) h x is in weak L 1 but not in 1}{R X ), 

(c) h c is not in weak L 1 if c > 1. 

20 (a) For any set E c R 2 , the boundary dE of E is, by definition, the closure of E minus the interior 
of E. Show that E is Lebesgue measurable whenever m{dE) = 0. 

(h) Suppose that E is the union of a ( possibly uncountable ) collection of closed discs in R 2 whose 
radii are at least 1. Use (a) to show that E is Lebesgue measurable. 

(c) Show that the conclusion of ( b ) is true even when the radii are unrestricted. 

(d) Show that some unions of closed discs of radius 1 are not Borel sets. (See Sec. 2.21.) 

( e ) Can discs be replaced by triangles, rectangles, arbitrary polygons, etc., in all this? What is 
the relevant geometric property? 
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21 If/ is a real function on [0, 1] and 


y(t) = t + i/(0, 

the length of the graph of/ is, by definition, the total variation of y on [0, 1]. Show that this length is 
finite if and only if/e BV. (See Exercise 13.) Show that it is equal to 

JVl + [/'(t )] 2 dt 

if / is absolutely continuous. 

22 (a) Assume that both / and its maximal function Mf are in l}(R k ). Prove that then fix) = 0 a.e. 
[m]. Hint : To every other / e l}(R k ) corresponds a constant c = c(f) > 0 such that 

(M/Xx) ^c|x|- k 

whenever | x | is sufficiently large. 

(b) Define f(x) = x~ ^log x)~ 2 if 0 < x < i,/(x) = 0 on the rest of JR 1 . Then / e L^R 1 ). Show that 
(M/Xx) ^ 1 2x log (2x) | - 1 (0 < x < 1/4) 

so that (M/Xx) dx — co. 

23 The definition of Lebesgue points, as made in Sec. 7.6, applies to individual integrable functions, 
not to the equivalence classes discussed in Sec. 3.10. However, if F e l}(R k ) is one of these equivalence 
classes, one may call a point x 6 R k a Lebesgue point of F if there is a complex number, let us call it 
(SFXx), such that 

lim/- f I/— (SF)(x)| dm = 0 

r—O m ( B r) JsU.r, 

for one (hence for every) / e F. 

Define ( SF)(x ) to be 0 at those points x e R k that are not Lebesgue points of F. 

Prove the following statement: If/e F, and x is a Lebesgue point of f then x is also a Lebesgue 
point of F, and f{x) — (SFXx). Hence SF e F. 

Thus S “ selects ” a member of F that has a maximal set of Lebesgue points. 



CHAPTER 

EIGHT 

INTEGRATION ON PRODUCT SPACES 


This chapter is devoted to the proof and discussion of the theorem of Fubini 
concerning integration of functions of two variables. We first present the theorem 
in its abstract form. 

Measurability on Cartesian Products 

8.1 Definitions If X and Y are two sets, their cartesian product X x Y is the 
set of all ordered pairs (x, y\ with x e X and y e Y. If A c= X and B a Y, it 
follows that A x B a X x Y. We call any set of the form A x B a rectangle 
in X x Y. 

Suppose now that (X, and (7, &~) are measurable spaces. Recall that 
this simply means that is a a-algebra in X and is a cr-algebra in Y. 

A measurable rectangle is any set of the form A x B, where A e ¥ and 
Be«f. 

If Q = R, u • • • u R n , where each R t is a measurable rectangle and R t n 
Rj — 0 for i ^ j, we say that Qe the class of all elementary sets. 

y x is defined to be the smallest a-algebra in X x Y which contains 
every measurable rectangle. 

A monotone class SOI is a collection of sets with the following properties : 
If A t e SOI, B t e SOI, A { c= A i + 19 B { id B i+l9 for i = 1, 2, 3, ... , and if 


A={jA t , B=f)B i , 


( 1 ) 


then A e SOI and B eWl. 


160 
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If E a X x Y, x e X, y e Y, we define 

E x = {y: (. x , y) e £}, £* = {x: (x, y) e E}. (2) 

We call E x and the x-section and y-section, , respectively, of £. Note that 
E x c y, E y a X. 


8.2 Theorem If E e Sf x then E x e ZT and E y e <9^, for every x e X and 
ye Y. 

Proof Let Q be the class of all E e x ZT such that E x e ZT for every 
xeXAIE — Ax B, then E x = B if x e A, E x = 0 if x £ A. Therefore every 
measurable rectangle belongs to Q. Since ZT is a <x-algebra, the following 
three statements are true. They prove that Q is a <x-algebra and hence that 

Q = ¥ x 3T \ 


(a) X x Y efi. 

(b) If E e Q, then ( E c ) x = (£ x ) c , hence E c e Q. 

(c) If Ei e Q (i = 1, 2, 3, . . .) and E = (J E { , then E x = (J , hence E e Q. 

The proof is the same for E y . //// 


8.3 Theorem x ZT is the smallest monotone class which contains all elemen- 
tary sets. 

Proof Let be the smallest monotone class which contains S\ the proof 
that this class exists is exactly like that of Theorem 1.10. Since x 3T is a 
monotone class, we have c= Sf x 2T . 

The identities 

(A^ x Bf) n (A 2 x B 2 ) = 04i n A 2 ) x (*i n B 2 ), 

C^i x B x ) — (A 2 x B 2 ) = \_(A l — A 2 ) x Bf\ u [(^4i n A 2 ) x (B l — B 2 f\ 

show that the intersection of two measurable rectangles is a measurable rec- 
tangle and that their difference is the union of two disjoint measurable rec- 
tangles, hence is an elementary set. \i P e S and Q e S’, it follows easily that 
P n Q e $ and P — Q e S. Since 

PvQ = (P-Q)kjQ 

and (P — Q) n Q = 0, we also have P u Q e S. 

For any set P c= X x 7, define Q(P) to be the class of all Q a X x Y such 
that P — QeWl,Q — PeWl, and P u Q e SDL The following properties are 
obvious: 

(a) Q e Q(P) if and only if P e Q(Q). 

(b) Since SUl is a monotone class, so is each Q(P). 
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Fix P e S. Our preceding remarks about $ show that Q e Q(P) for all 
Q e $, hence $ <= Q(P), and now (b) implies that 901 <= Q(P). 

Next, fix Q e 901. We just saw that Q e Q(P) if P e S. By (a), P e Q(®, 
hence $ <= Q(0, and if we refer to (b) once more we obtain 901 <= Q(0. 
Summing up: IfP and Q e 901, then P — Q e 901 and P u Q e 901. 

It now follows that 901 is a a-algebra in X x Y : 

(i ) X xY eS. Hence X x Y e 901. 

(ii) If Q e 901, then Q c e 901, since the difference of any two members of 901 is 
in 901. 

(iii) If P { e 901 for i — 1, 2, 3, ... , and P = (J P t , put 

Q n = Pi u ■■■ kj P„. 

Since 901 is closed under the formation of finite unions, Q n e 901. 

Since Q n a Q n + l and P = (J Q n , the monotonicity of 901 shows that 
PeWl. 

Thus 901 is a cr-algebra, $ a 901 <= Sf x 2T, and (by definition) x 2T is 
the smallest a-algebra which contains S. Hence 901 = ^ x //// 

8.4 Definition With each function / on X x Y and with each x e X we 
associate a function f x defined on Y by f x (y) = /(x, y). 

Similarly, if y e Y,f y is the function defined on X by f y (x) = /(x, y). 

Since we are now dealing with three a-algebras, £f, 3T , and Sf x ZT, we 
shall, for the sake of clarity, indicate in the sequel to which of these three 
a-algebras the word “ measurable ” refers. 


8.5 Theorem Let f be an (ff x &~)-measur able function on X x Y. Then 

(a) For each x e X,f x is a 3T - measurable function. 

(b) For each ye Y,f y is an -measurable function. 

Proof For any open set V, put 

Q = {(*, y):f(x, y) e V}. 

Then Q e x and 

Q x = {y.fjy) e v}. 


Theorem 8.2 shows that Q x e This proves (a); the proof of (b) is 
similar. //// 
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Product Measures 


8.6 Theorem Let (X, 9, p) and (7, ,T, /.) be a-finite measure spaces. Suppose 
Q e 9 x ST. If 

(p(x) = HQ X ), My) = p(Q y ) (i) 

for every x e X and y e 7, then cp is -measurable, ifr is 2T - measurable , and 



(2) 


Notes : The assumptions on the measure spaces are, more explicitly, that \i and X 
are positive measures on Sf and 3T , respectively, that X is the union of countably 
many disjoint sets X n with fi(X n ) < oo, and that Y is the union of countably 
many disjoint sets Y m with X(Y m ) < oo. 

Theorem 8.2 shows that the definitions (1) make sense. Since 

HQx) = J \q(x, y) dA(y) (x e X), (3) 

with a similar statement for ii(Q y ) 9 the conclusion (2) can be written in the form 

dp (x) f Xq(x, y) dA{y) = f dX{y) f * e (x, y) dp(x). (4) 

JX JY JY JX 

Proof Let Q be the class of all Q e ¥ x 2T for which the conclusion of the 
theorem holds. We claim that Q has the following four properties : 

(a) Every measurable rectangle belongs to Q. 

(b) If Qi Q 2 c Q 3 a • •• , if each Q t e Q, and if Q = (J Q t , then Q e Q. 

(c) If {Qi} is a disjoint countable collection of members of Q, and if 

Q = U Ci.thenQea. 

(d) If fi(A) < oo and X(B) < oo, if A x B id Q x id Q 2 => Q 3 => • • • , if Q = H Qi 
and Qi e Q for i = 1, 2, 3, ... , then Q e Q. 

If Q = A x B, where A e Sf, B e ZT , then 

KQx) = *(B)Xa(x) and fi(Q y ) = fi(A)x B {yl (5) 

and therefore each of the integrals in (2) is equal to fj,(A)X(B). This gives (a). 

To prove (b), let cp t and be associated with Q { in the way in which (1) 
associates cp and i/j with Q. The countable additivity of \i and X shows that 

<pi(x)^ <p(x), •J'iiy)^ My) (i-Kx>), (6) 

the convergence being monotone increasing at every point. Since cp { and 
are assumed to satisfy the conclusion of the theorem, ( b ) follows from the 
monotone convergence theorem. 
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For finite unions of disjoint sets, (c) is clear, because the characteristic 
function of a union of disjoint sets is the sum of their characteristic functions. 
The general case of (c) now follows from ( b ). 

The proof of (< d ) is like that of (b), except that we use the dominated 
convergence theorem in place of the monotone convergence theorem. This is 
legitimate, since p(A) < oo and X(B) < oo. 

Now define 

Q mn = Qn(X n x 7J (m,n = 1,2,3,...) (7) 

and let $01 be the class of all Q e ¥ x F such that Q mn e Q for all choices of 
m and n . Then (b) and (d) show that $01 is a monotone class; (a) and (c) show 
that & $01; and since $01 <= X 2T, Theorem 8.3 implies that $01 = Sf x 2T . 

Thus Q mn e Q for every Q e 9* x 2T and for all choices of m and n. Since 
Q is the union of the sets Q mn and since these sets are disjoint, we conclude 
from (c) that Q e Q. This completes the proof. //// 

8.7 Definition If ( X , Sf, p) and (7, 2T, X) are as in Theorem 8.6, and if 
g e y x «f , we define 

in x m) = | m x ) dn(x) = jW) dm (i) 

The equality of the integrals in (1) is the content of Theorem 8.6. We call 
p x X the product of the measures p and X. That p x X is really a measure 
(i.e., that p x X is countably additive on x F) follows immediately from 
Theorem 1.27. 

Observe also that p x X is a-finite. 

The Fubini Theorem 

8.8 Theorem Let ( X , p) and (7, 3T , X) be o-finite measure spaces , and let f 

be an x &~)- measurable function on X x 7. 

(a) I/O </< oo, and i/ 

<p(x) = J/x A •/'OO = j f y dfi (x s X, y e 7), (1) 

then q> is measurable , if/ is ZT - measurable , and 

L^=f (2) 

Jx Jx X y JY 

(b) Iff is complex and if 

(p*(x) = J | f\ x dX and J (p* dp < oo, (3) 


then f g I}(p x X). 
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(c) Iff g I}(fi x 2), then f x e I}(X) for almost all x g X,f y g l}(p) for almost all 
y g Y ; the functions <p and ij/, defined by (1) a.e ., are in l}(p) and I}(X), 
respectively , and (2) holds. 


Notes : The first and last integrals in (2) can also be written in the more usual 
form 


f Mx) j* f{x, y) dA(y) = f dA(y) f f(x, y) dfi(x). (4) 

%)X JY JY %)X 

These are the so-called “ iterated integrals ” of /. The middle integral in (2) is often 
referred to as a double integral 

The combination of ( b ) and (c) gives the following useful result: If f is 
iff x 2T)- measurable and if 


^Mx)^\f(x, y) | dA(y) < co, (5) 

then the two iterated integrals (4) are finite and equal 

In other words, “the order of integration may be reversed” for x !T)- 
measurable functions / whenever / > 0 and also whenever one of the iterated 
integrals of | / 1 is finite. 

Proof We first consider (a). By Theorem 8.5, the definitions of (p and ^ 
make sense. Suppose Qe^xf and / = Xq • By Definition 8.7, (2) is then 
exactly the conclusion of Theorem 8.6. Hence (a) holds for all nonnegative 
simple ( Sf x ^-measurable functions s. In the general case, there is a 
sequence of such functions s„, such that 0 < s x < s 2 < * * * and s n (x , y)— ► 
/ (x, y) at every point of X x Y. If <p n is associated with s n in the same way in 
which <p was associated to f we have 

I <P n dn = I s n d(n x A) (n = 1, 2, 3, . . .)• (6) 

JX JX x Y 

The monotone convergence theorem, applied on (7, 2), shows that (p n {x) 

increases to <p(x ), for every xel, as n— ► oo. Hence the monotone con- 
vergence theorem applies again, to the two integrals in (6), and the first 
equality (2) is obtained. The second half of (2) follows by interchanging the 
roles of x and y. This completes (a). 

If we apply (a) to | / 1 , we see that (b) is true. 

Obviously, it is enough to prove (c) for real l}(p x X)\ the complex case 
then follows. If /is real, (a) applies to f + and to/ - . Let q>i and cp 2 corre- 
spond to / + and /“ as q> corresponds to / in (1). Since f g I}(p x X) and 
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f + <|/|, and since (a) holds for / + , we see that cp x e l}(g). Similarly, cp 2 e 
Since 

/* = (/ + )* -(/“)* (7) 

we have / x e L 1 ^) for every x for which cpi(x) < oo and cp 2 (x ) < oo; since <p x 
and cp 2 are in this happens for almost all x ; and at any such x, we have 
(p(x) = ^i(x) — cp 2 (x). Hence cp e Now (2) holds with <p x and / + and 
with <jp 2 and /“, in place of cp and/; if we subtract the resulting equations, we 
obtain one half of (c). The other half is proved in the same manner, with f y 
and ^ in place of f x and cp. //// 


8.9 Counterexamples The following three examples will show that the various 
hypotheses in Theorems 8.6 and 8.8 cannot be dispensed with. 


(a) Let X = Y = [0, 1], \i = X = Lebesgue measure on [0, 1], Choose {<5„} so 
that 0 = < b 2 < d 3 < • • • , ► 1, and let g n be a real continuous function 

with support in (<5 W , <5 W + 1 ), such that g n (t) dt = 1, for n = 1, 2, 3, — Define 

OO 

fix, y) = £ [ 0 „(x) - 0 I , + i(*)M.y)- 

n= 1 

Note that at each point (x, y) at most one term in this sum is different from 0. 
Thus no convergence problem arises in the definition of /. An easy computa- 
tion shows that 



/(x, y)dy= 1^0 = 



/(x, y) dx, 


so that the conclusion of the Fubini theorem fails, although both iterated 
integrals exist. Note that / is continuous in this example, except at the point 
(1, 1), but that 


Jo 


dx 


n 

l/(*»>OI <iy = °°- 

Jo 


{b) Let X = Y = [0, 1], \i — Lebesgue measure on [0, 1], X — counting measure 
on 7, and put/(x, y) = 1 if x = y,/(x, y) = 0 if x # y. Then 

jV(x, y) dfi(x) = 0, J fix, _y) dX(y) = 1 

for all x and y in [0, 1], so that 

I* dX(y) f fix, y) dn(x) = 0*1 = f dfi(x) f f(x, >>) dX(y). 

%/y jx jx jy 

This time the failure is due to the fact that X is not <7-finite. 

Observe that our function / is (Sf x ^-measurable, if is the class of 
all Lebesgue measurable sets in [0, 1] and 3T consists of all subsets of [0, 1]. 
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To see this, note that f = Xd> where D is the diagonal of the unit square. 
Given n, put 



and put 


Q n = (h x 1 1 ) u (1 2 x 1 2 ) u • • • u (/„ x /„). 

Then Q n is a finite union of measurable rectangles, and D = P) Q n . 

(c) In examples (a) and (b), the failure of the Fubini theorem was due to the fact 
that either the function or the space was “ too big.” We now turn to the role 
played by the requirement that / be measurable with respect to the c-algebra 

y x/. 

To pose the question more precisely, suppose p(X) = A(7)=1,0</<1 
(so that “bigness” is certainly avoided); assume f x is ^"-measurable and f y is 
^-measurable, for all x and y ; and assume <p is ^-measurable and ^ is 
^"-measurable, where cp and \j/ are defined as in 8.8(1). Then 0 < cp < 1, 
0 < i// < 1, and both iterated integrals are finite. (Note that no reference to 
product measures is needed to define iterated integrals.) Does it follow that 
the two iterated integrals of / are equal ? 

The (perhaps surprising) answer is no. 

In the following example (due to Sierpinski), we take 

(X, fi) = (7, X) = [0, 1] 


with Lebesgue measure. The construction depends on the continuum hypoth- 
esis. It is a consequence of this hypothesis that there is a one-to-one mapping 
j of the unit interval [0, 1] onto a well-ordered set W such that j(x) has at 
most countably many predecessors in W, for each x e [0, 1]. Taking this for 
granted, let Q be the set of all (x, y ) in the unit square such that fix) precedes 
fiy) in W. For each x e [0, 1], Q x contains all but countably many points of 
[0, 1]; for each y e [0, 1], Q y contains at most countably many points of 
[0, 1]. If / = X Q , it follows that f x and f y are Borel measurable and that 

<p{x) = f{x, y) dy = 1, ipiy) = 


: 


f{x, y) dx = 0 


for all x and y. Hence 



fix, y) dy = 1 # 0 = 



fix, y) dx. 


Completion of Product Measures 

8.10 If (A', fi) and (7, X) are complete measure spaces, it need not be true 
that (I x 7, ^ x / , x / 1) is complete. There is nothing pathological about 
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this phenomenon: Suppose that there exists an A e A ^ 0, with p(A) — 0; 

and suppose that there exists a B Y so that B $ 3T. Then A x B A x 7, 
(p x X)(A x 7) = 0, but A x B $ 9 x 0T . (The last assertion follows from 
Theorem 8.2.) 

For instance, if p = 2 = (Lebesgue measure on R 1 ), let A consist of any 
one point, and let B be any nonmeasurable set in R 1 . Thus x is not a 
complete measure; in particular, x is not m 2 , since the latter is complete, 
by its construction. However, m 2 is the completion of x This result gener- 
alizes to arbitrary dimensions : 

8.11 Theorem Let m k denote Lebesgue measure on R k . If k = r + s, r > 1, 
s > 1, then m k is the completion of the product measure m r x m s . 

Proof Let 08 k and ffll k be the a-algebras of all Borel sets and of all Lebesgue 
measurable sets in R k , respectively. We shall first show that 

^ k ^m r xm s ^m k . (1) 

Every /c-cell belongs to 5R r x 2R S . The tr-algebra generated by the /c-cells is 
08 k . Hence 08 k c 9W r x . Next, suppose E e 2R r and Fef s . It is easy to 
see, by Theorem 2.20(b), that both E x R s and R r x F belong to The 
same is true of their intersection E x F. It follows that x Wl s <= . 

Choose Q e SK r x $R S . Then QeWl k , so there are sets and P 2 e 08 k 
such that P i czQ<=P 2 and m k (P 2 — P x ) = 0. Both m k and m r x m s are trans- 
lation invariant Borel measures on R k . They assign the same value to each 
/c-cell. Hence they agree on 08 k , by Theorem 2.20(6?). In particular, 


K x ™ S )(Q ~ Pi) ^ K x m s )(P 2 - P t ) = m k (P 2 - P l ) = 0 
and therefore 


(m r x m s )(Q) = (m r x m^PJ = m k (Pf) = m k (Q). 

So m r x m s agrees with m k on x 

It now follows that 901* is the ( m r x m s )-completion of 50l r x 50l s , and this 
is what the theorem asserts. //// 

We conclude this section with an alternative statement of Fubini’s theorem 
which is of special interest in view of Theorem 8.11. 

8.12 Theorem Let ( X , Sf 9 p) and (7, 0T, 2) be complete o-finite measure spaces. 
Let (9 x 0T)* be the completion of 9 x , relative to the measure p x L Let 
f be an (9 x 3T)* -measurable function on X x 7. Then all conclusions of 
Theorem 8.8 hold , the only difference being as follows: 

The 3T -measurability of f x can be asserted only for almost all xel, 
so that q>(x) is only defined a.e. [//] by 8.8(1); a similar statement holds for f y 
and i//. 
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The proof depends on the following two lemmas : 

Lemma 1 Suppose v is a positive measure on a a-algebra 901, 901* is the com- 
pletion of 901 relative to v, and f is an 901 * -measurable function. Then there 
exists an 901 - measurable function g such that f = g a.e. [i;]. 

(An interesting special case of this arises when v is Lebesgue measure on 
R k and 901 is the class of all Borel sets in R k .) 

Lemma 2 Let h be an (9 x ZT)* -measurable function on X x Y such that 
h = 0 a.e. [/i x 2]. Then for almost all x g X it is true that h(x , y) = 0 for 
almost all y g Y; in particular , h x is -measurable for almost all x g X. A 
similar statement holds for h y . 

If we assume the lemmas, the proof of the theorem is immediate: If /is as in 
the theorem, Lemma 1 (with v = p x X) shows that f = g - \- h, where h = 0 a.e. 
\_p x X] and g is (9 x ^-measurable. Theorem 8.8 applies to g. Lemma 2 shows 
that f x = g x a.e. [2] for almost all x and that f y = g y a.e. [^] for almost all y. 
Hence the two iterated integrals of f as well as the double integral, are the same 
as those of g , and the theorem follows. 

Proof of Lemma 1 Suppose / is 90l*-measurable and / > 0. There exist 901*- 
measurable simple functions 0 = s 0 < < s 2 < • • • such that s„(x)— ► f(x) for 

each x g X, as n— ► oo. Hence /= £ (s w + 1 — s n ). Since s n + 1 — s n is a finite 
linear combination of characteristic functions, it follows that there are con- 
stants c t > 0 and sets E t e 901* such that 

/(*)= Z c i *£,(*) (*eX). 

1=1 

The definition of 901* (see Theorem 1.36) shows that there are sets A t e 901, 
B t g 901, such that A t a E t <= B t and v(B t — A t ) — 0. Define 

OO 

g(x)= lc iXA ,(x) (xeX). 

i = 1 

Then the function g is 901-measurable, and g(x) = / (x), except possibly when 
x g (J ( E t — A t ) cz (J (B t — Ai). Since v(B t — A t ) = 0 for each i, we conclude 
that g =f a.e. [i;]. The general case (/ real or complex) follows from this. //// 

Proof of Lemma 2 Let P be the set of all points in X x Y at which 
h(x , y) t * 0. Then P g (9 x 3Ty and (p x X)(P) = 0. Hence there exists a 
2e^x«f such that P <= Q and (p x X)(Q) = 0. By Theorem 8.6, 

j/e*) dfi(x) = 0. 


(i) 
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Let N be the set of all x e X at which A(Q X ) > 0. It follows from (1) that 
n(N) = 0. For every x N, A(Q X ) = 0. Since P x a Q x and (7, X) is a com- 
plete measure space, every subset of P x belongs to 2T if x £ N. If y $ P x , then 
h x (y ) = 0. Thus we see, for every x $ N, that h x is ^“-measurable and that 
h x (y) = 0 a.e. [*]. //// 


Convolutions 

8.13 It happens occasionally that one can prove that a certain set is not empty 
by proving that it is actually large. The word “large” may of course refer to 
various properties. One of these (a rather crude one) is cardinality. An example is 
furnished by the familiar proof that there exist transcendental numbers: There 
are only countably many algebraic numbers but uncountably many real 
numbers, hence the set of transcendental real numbers is not empty. Applications 
of Baire’s theorem are based on a topological notion of largeness : The dense G/s 
are “large” subsets of a complete metric space. A third type of largeness is 
measure-theoretic: One can try to show that a certain set in a measure space is 
not empty by showing that it has positive measure or, better still, by showing 
that its complement has measure zero. Fubini’s theorem often occurs in this type 
of argument. 

For example, let / and g e l}( R 1 ), assume /> 0 and g > 0 for the moment, 
and consider the integral 

f 00 

h(x) = f(x — t)g(t) dt ( — oo < x < oo). (1) 

J- 00 

For any fixed x, the integrand in (1) is a measurable function with range in 
[0, oo], so that h(x ) is certainly well defined by (1), and 0 < h(x) < oo. 

But is there any x for which h(x) < oo? Note that the integrand in (1) is, for 
each fixed x, the product of two members of L 1 , and such a product is not always 
in L 1 . [Example: /(x) = g(x) = l/^/x if 0 < x < 1, 0 otherwise.] The Fubini 
theorem will give an affirmative answer. In fact, it will show that h e lliR 1 ), 
hence that h(x) < oo a.e. 


8.14 Theorem Suppose f e IfiR 1 ), g e L*( R x ). Then 



\f{x-y)g(y)\ dy < oo 


for almost all x. For these x, define 


Then h e L 1 ^ 1 ), and 


h(x) = 



/(* - y)g(y) dy. 


(1) 

( 2 ) 




( 3 ) 
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where 


ll/lli 



I /Ml dx. 


( 4 ) 


We call h the convolution of / and g , and write h=f * g. 


Proof There exist Borel functions f 0 and g 0 such that f 0 = / a.e. and g 0 = g 
a.e. The integrals (1) and (2) are unchanged, for every x, if we replace /by f 0 
and by ^r 0 . Hence we may assume, to begin with, that / and g are Borel 
functions. 

To apply Fubini’s theorem, we shall first prove that the function F 
defined by 


F(x 9 y)=f(x- y)g(y) (5) 

is a Borel function on R 2 . 

Define cp \ F 2 — > R l and ij/: F 2 — ► R l by 

<?M y) = x-y 9 iAM y) = y- (6) 

Then f(x — y) = (/© ^)(x, y) and ^f(y) = (g o ^)(x, y). Since <p and i// are Borel 
functions, Theorem 1.12(d) shows that f o cp and g ° ij/ are Borel functions on 
K 2 . Hence so is their product. 

Next we observe that 


dy 


| F(x, y) | dx 


I 


\g(y)\dy |/(x - y)| dx = WfhWgWu (7) 


since 


r 


|/(x - y)| dx = ll/lli 


( 8 ) 


for every y e F 1 , by the translation-invariance of Lebesgue measure. 

Thus F e 1}(R 2 \ and Fubini’s theorem implies that the integral (2) exists 
for almost all x e R l and that h e l}( R 1 ). Finally, 


\\hh = 


| h(x) | dx 


Coo a 

L4 


Coo f* o 

*L dx l 


I F(x,y)\dy 


\F(x,y)\dx=\\f\\ 1 \\g\\ 1 , 


by (7). This gives (3), and completes the proof. 


Ill / 


Convolutions will play an important role in Chap. 9. 
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Distribution Functions 

8.15 Definition Let \i be a a-finite positive measure on some a-algebra in 
some set X. Let /: X—> [0, oo] be measurable. The function that assigns to 
each t g [0, oo) the number 

l^{f> t} = n({xe X:f(x)>t}) (1) 

is called the distribution function of / It is clearly a monotonic (nonincreasing) 
function of t and is therefore Borel measurable. 

One reason for introducing distribution functions is that they make it 
possible to replace integrals over X by integrals over [0, oo); the formula 

f fdn = %{/> t } it (2) 

JX Jo 

is the special case cp(t) = t of our next theorem. This will then be used to 
derive an L p -property of the maximal functions that were introduced in Chap. 
7. 


8.16 Theorem Suppose that f and p are as above , that cp\ [0, oo]-> [0, oo] is 
monotonic , absolutely continuous on [0, T]/or every T < oo, and that cp( 0) = 0 
and cp(t)-+ cp(oo) as t — » oo. Then 

\((pof)dfi=\ n{f > t}(p'(t) dt. (1) 

JX Jo 

Proof Let E be the set of all (x, t) e X x [0, oo) where f(x) > t. When / is 
simple, then E is a union of finitely many measurable rectangles, and is there- 
fore measurable. In the general case, the measurability of E follows via the 
standard approximation of /by simple functions (Theorem 1.17). As in Sec. 
8.1, put 


J E* = {x g X: (x, t) g Ej (0 < t < oo ). 
The distribution function of/ is then 


p(E { ) 


J x E {x , t) dp(x). 


The right side of (1) is therefore 



MWW dt 



Xe(x , t)(p'(t) dt , 


( 2 ) 

( 3 ) 


( 4 ) 


by Fubini’s theorem. 
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For each xel, * £ (x, 0=1 if t <f( x ) and is 0 if t > f(x). The inner 
integral in (4) is therefore 



(p'(t) dt = <p(f(xj) 


(5) 


by Theorem 7.20. Now (1) follows from (4) and (5). 


//// 


8.17 Recall now that the maximal function Mf lies in weak L 1 when / e I}(R k ) 
(Theorem 7.4). We also have the trivial estimate 

\\Mf\\„< || /|L (1) 

valid for all / e L°°(R fc ). A technique invented by Marcinkiewicz makes it possible 
to “ interpolate ” between these two extremes and to prove the following theorem 
of Hardy and Littlewood (which fails when p = 1 ; see Exercise 22, Chap. 7). 

8.18 Theorem 7/1 < p < oo and f e IF(R k ) then Mf e IF(R k ). 


Proof Since M/= M(|/|) we may assume, without loss of generality, that 
/ > 0. Theorem 7.4 shows that there is a constant A , depending only on the 
dimension k 9 such that 


m{Mg > t} <— H^llx (1) 

for every g e I}(R k ). Here, and in the rest of this proof, m — m k , the Lebesgue 
measure on R k . 

Pick a constant c, 0 < c < 1, which will be specified later so as to mini- 
mize a certain upper bound. For each t e (0, oo), split / into a sum 


/= Gt + K 


(2) 


where 


= 


f(x) if f(x)>ct 

0 if f(x) < ct. 

Then 0 < h t (x) < ct for every x e R k . Hence h t e L 00 , Mh t < ct 9 and 

Mf < Mg t + Mh t < Mg t + ct. 

If (M/)(x) > t for some x, it follows that 

(Mg t )(x) > (1 - c)t. 

Setting E t = {/> ct} 9 (5), (1), and (3) imply that 

A 


m{Mf > t} < m{Mg t > (1 — c)t} < 


(1 - c)t 




f dm. 


(3) 

(4) 

(5) 
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We now use Theorem 8.16, with X = R k , \i = m, <p(t) = t p , to calculate 


(Mf) p dm — p 


m{Mf > t]t p 1 dr < - — — J 2 Jr J / 

1 — C JO Je, 




Ape 


i -p 


(1 - cXp - 1) 


L 

jRk 


/ p dm. 


This proves the theorem. However, to get a good constant, let us choose c so 
as to minimize that last expression. This happens when c = (p — 1 )/p = 1/g, 
where g is the exponent conjugate to p. For this c, 


1 + 


1 V" 1 


P-1 


< e. 


and the preceding computation yields 

||M/|| p < C p \\f\\ p 

where C p = ( Aepq ) l/P . 


( 6 ) 

//// 


Note that C p -> 1 as p-> oo, which agrees with formula 8.17(1), and that 
C p — ► oo as p— > 1. 


Exercises 

1 Find a monotone class 501 in R l which is not a c-algebra, even though R l eWl and R l — A e 501 for 
every A e 5R. 

2 Suppose /is a Lebesgue measurable nonnegative real function on R l and A(f) is the ordinate set of 
/ This is the set of all points (x, y) e R 2 for which 0 < y < f(x). 

(a) Is it true that A(f) is Lebesgue measurable, in the two-dimensional sense? 

(b) If the answer to (a) is affirmative, is the integral of / over R 1 equal to the measure of A(f)7 

( c ) Is the graph of/a measurable subset of R 2 ? 

( d ) If the answer to (c) is affirmative, is the measure of the graph equal to zero? 

3 Find an example of a positive continuous function / in the open unit square in R 2 , whose integral 
(relative to Lebesgue measure) is finite but such that q>{x) (in the notation of Theorem 8.8) is infinite 
for some x e (0, 1). 

4 Suppose 1 < p < oo,/ e L}(R l ), and g e l?(R l ). 

(a) Imitate the proof of Theorem 8.14 to show that the integral defining (/ * exists for 
almost all x, that / * g e l?(R l ), and that 


II/* 0ll p < ll/llillflfllp. 

(b) Show that equality can hold in (a) if p = 1 and if p = oo, and find the conditions under which 
this happens. 

(c) Assume 1 < p < oo, and equality holds in (a). Show that then either / = 0 a.e. or g = 0 a.e. 

(d) Assume l<p<oo,e>0, and show that there exist f e l)(R l ) and g e IF(R l ) such that 


ll/*flf|lp>(l-€)||/|| 1 ||flf|| p . 
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5 Let M be the Banach space of all complex Borel measures on R 1 . The norm in M is ||/i|| = I^KR 1 ). 
Associate to each Borel set E <= R l the set 

E 2 = {(*> y): x + y e E} <= R 2 . 

If fi and X e M, define their convolution fi * X to be the set function given by 

* m) = 01 x X XE 2 ) 

for every Borel set E <= R 1 ; fi x X is as in Definition 8.7. 

(u) Prove that fi * X e M and that \\fi * X\\ < ||/i|| ||A||. 

(b) Prove that fi * X is the unique v e M such that 

jfdv = jj f(x + y) dn(x) dMy) 

for every / e C^R 1 ). (All integrals extend over R 1 .) 

(c) Prove that convolution in M is commutative, associative, and distributive with respect to 
addition. 

(d) Prove the formula 


(fi * xm = 


I 


me - 1) dm 


for every n and X e M and every Borel set E. Here 


E — t = {x — t: x e E}. 

(e) Define fi to be discrete if fi is concentrated on a countable set; define fi to be continuous if 
fi({x}) = 0 for every point x e R 1 ; let m be Lebesgue measure on R 1 (note that m i M). Prove that 
fi * X is discrete if both fi and X are discrete, that fi * X is continuous if fi is continuous and X e M, 
and that fi * X 4, m il fi <4, m. 

(/) Assume dfi—f dm, dX — g dm, f e I}(R l ), and g e L 1 ^ 1 ), and prove that 
d(fi * X) = (/ * g) dm. 

(g) Properties (a) and (c) show that the Banach space M is what one calls a commutative Banach 
algebra. Show that (e) and (/) imply that the set of all discrete measures in M is a subalgebra of M , 
that the continuous measures form an ideal in M, and that the absolutely continuous measures 
(relative to m) form an ideal in M which is isomorphic (as an algebra) to l}(R l ). 

(h) Show that M has a unit, i.e., show that there exists a S e M such that S * fi = fi for all 
fi e M. 

(i) Only two properties of R 1 have been used in this discussion: R 1 is a commutative group 
(under addition), and there exists a translation invariant Borel measure m on R 1 which is not identi- 
cally 0 and which is finite on all compact subsets of R 1 . Show that the same results hold if R 1 is 
replaced by R k or by T (the unit circle) or by T k (the /c-dimensional torus, the cartesian product of k 
copies of T), as soon as the definitions are properly formulated. 

6 (Polar coordinates in R k .) Let S k _ x be the unit sphere in R k , i.e., the set of all u e R k whose distance 
from the origin 0 is 1. Show that every x e R k , except for x = 0, has a unique representation of the 
form x = ru, where r is a positive real number and u e S k _ v Thus R k — {0} may be regarded as the 
cartesian product (0, 00 ) x S k - V 

Let m k be Lebesgue measure on R k , and define a measure o k _ t onS t _j as follows: If A cz S k _ x 
and A is a Borel set, let A be the set of all points ru, where 0 < r < 1 and u e A, and define 


cr k _ 1 (A) = /c • m k (A). 
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Prove that the formula 


f fdm k = r* 1 dr f f(ru) da k . x (u) 
jR k Jo Js k - 1 

is valid for every nonnegative Borel function / on R k . Check that this coincides with familiar results 
when k = 2 and when k = 3. 

Suggestion : If 0 < r 1 < r 2 and if A is an open subset of S k _ u let £ be the set of all ru with 
r t < r < r 2 , u e A, and verify that the formula holds for the characteristic function of E. Pass from 
these to characteristic functions of Borel sets in R k . 

7 Suppose ( X , Sf, n) and (7, X) are c-finite measure spaces, and suppose ^ is a measure on 
x P such that 


i )/(A x B) = fi{A)X{B) 

whenever A e Sf and BeJ. Prove that then \)/(E) — {fix X)(E) for every E e x P . 

8 {a) Suppose / is a real function on R 2 such that each section f x is Borel measurable and each section 
f y is continuous. Prove that /is Borel measurable on R 2 . Note the contrast between this and Example 
8.9(c). 

(b) Suppose g is a real function on R k which is continuous in each of the k variables separately. 
More explicitly, for every choice of x 2 , . . . , x k , the mapping x x — > g(x lt x 2 , . . . , x k ) is continuous, etc. 
Prove that g is a Borel function. 

Hint : If (i — l)/n = a i _ 1 < x < a { , = i/n, put 




a { — x 


a i ~ a i - 1 


/(fli-i. y) + 



/K-> y)- 


9 Suppose £ is a dense set in R 1 and / is a real function on R 2 such that [a)f x is Lebesgue measurable 
for each x e £ and ( b ) f y is continuous for almost all y e R l . Prove that /is Lebesgue measurable on 
R 2 . 

10 Suppose / is a real function on R 2 ,f x is Lebesgue measurable for each x, and f y is continuous for 
each y. Suppose g:R l ^R x is Lebesgue measurable, and put h{y) = f {g{y), y). Prove that h is 
Lebesgue measurable on R 1 . 

Hint : Define /„ as in Exercise 8, put h n {y ) = f n {g{y ), y\ show that each h n is measurable, and that 
h n (y)^h(y). 

11 Let 0& k be the <r-algebra of all Borel sets in R k . Prove that @ m+n = 38 m x @ n . This is relevant in 
Theorem 8.14. 

12 Use Fubini’s theorem and the relation 


1 

x 



dt 


(x > 0) 


to prove that 


lim 

■i4-* oo 



13. If fi is a complex measure on a c-algebra show that every set E e 9JI has a subset A for which 


Ia<04)|£-ImI(£). 

n 
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Suggestion : There is a measurable real function 6 so that d\i = e ie d | fi \ . Let A a be the subset of 
E where cos (0 — a) > 0, show that 

Re = Jcos + (0 - a) d | \i | , 

and integrate with respect to a (as in Lemma 6.3). 

Show, by an example, that \/n is the best constant in this inequality. 

14 Complete the following proof of Hardy’s inequality (Chap. 3, Exercise 14): Suppose / > 0 on 
(0, oo ),/ e If, 1 < p < oo, and 



r 


fit) dt . 


Write xF(x) = jj f(t)t a t a dt, where 0 < a < 1 /q, use Holder’s inequality to get an upper bound for 
F(x) p , and integrate to obtain 


f 


F p (x)dx<(\ -txq) i - p (ap)- 1 
Show that the best choice of a yields 


f"(t) dt. 




15 Put (p{t) = 1 — cos r if 0 < t < 2n, (p(t) = 0 for all other real t. For — oo < x < oo, define 


f(x) = 1, g(x) = cp\x). 


h(x) 


-l 


(p{t) dt. 


Verify the following statements about convolutions of these functions: 

(i) (/ * d)( x ) — 0 for all x. 

(ii) ig * h)ix) = iq> * (p){x) > 0 on (0, 4n). 

(iii) Therefore (/ * g) * h = 0, whereas f * ig * h) is a positive constant. 

But convolution is supposedly associative, by Fubini’s theorem (Exercise 5(c)). What went wrong? 
16 Prove the following analogue of Minkowski’s inequality, for/> 0: 



A*, y) dXiy) 


~ p i/p 
► <2/i(x)> 



f p (x, y) dfiix) 


Up 

dXiy). 


Supply the required hypotheses. (Many further developments of this theme may be found in [9].) 



CHAPTER 

NINE 

FOURIER TRANSFORMS 


Formal Properties 


9.1 Definitions In this chapter we shall depart from the previous notation 
and use the letter m not for Lebesgue measure on R 1 but for Lebesgue 
measure divided by yjln. This convention simplifies the appearance of 
results such as the inversion theorem and the Plancherel theorem. Accord- 
ingly, we shall use the notation 


f(x) dm{x) 


y/2n 


f(x) dx , 


where dx refers to ordinary Lebesgue measure, and we define 


\f(x)\ p dm(x) 


(1 < p < oo), 


(/ * 0)M = /(* - y)g(y) dm(y) (x e K 1 ), 


f(x)e lxt dm(x) ( t e K 1 ). 


Throughout this chapter, we shall write IF in place of IF(R l ), and C 0 will 
denote the space of all continuous functions on R l which vanish at infinity. 

If / g L 1 , the integral (4) is well defined for every real t. The function / is called 
the Fourier transform of / Note that the term “Fourier transform” is also applied 
to the mapping which takes / to / 


178 
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The formal properties which are listed in Theorem 9.2 depend intimately on 
the translation-invariance of m and on the fact that for each real a the mapping 
x— ► e i<tx is a character of the additive group R 1 . By definition, a function cp is a 
character of R 1 if | cp(t) \ = 1 and if 

<p(s + t) = (p(s)(p(t ) (5) 

for all real s and t. In other words, cp is to be a homomorphism of the additive 
group R 1 into the multiplicative group of the complex numbers of absolute value 
1. We shall see later (in the proof of Theorem 9.23) that every continuous charac- 
ter of R 1 is given by an exponential. 

9.2 Theorem Suppose f e L 1 , and a and X are real numbers. 

(a) If g(x) =f(x)e iax , then g(t) =f(t - a). 

(b) If g(x) =f(x - a), then g(t) = /(r)e" ia '. 

(c) If g e L l and h =f * g, then fi(t) =f(t)g(t). 

Thus the Fourier transform converts multiplication by a character into 
translation , and vice versa , and it converts convolutions to pointwise products. 

(d) If g(x) =/( — x), then g(t) =J{t). 

(e) If g(x) =f(x/X) and X > 0, then g(t) = Xf(Xt). 

(/) tf 9( x ) = —ixf(x) and g e L 1 , then fis differentiable and f\t) — g(t). 


Proof (a), ( b ), ( d ), and (e) are proved by direct substitution into formula 
9.1(4). The proof of (c) is an application of Fubini’s theorem (see Theorem 
8.14 for the required measurability proof): 


««=/_* 


-t 


e ltx dm(x) 


f(x - y)g(y) dm(y) 


g(y)e " y dm(y) f(x-y)e 


f 


-it(x-y) 


dm(x) 


g(y)e " y dm(y) 


f ( x)e itx dm(x) 


= )• 

Note how the translation-invariance of m was used. 
To prove (/), note that 


fjs) -/(f) 



me 


lxt (p(x , s — t) dm(x) 


(s 7^ t) 9 


s — t 


(1) 
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where cp(x , u) = ( e~ lxu — 1 )/u. Since | <p(x 9 u) \ < | x | for all real u ^ 0 and since 
<p(x 9 w)— ► —ix as u — > 0, the dominated convergence theorem applies to (1), if 
s tends to t 9 and we conclude that 


f'(t) = ~i 


* 00 

xf(x)e~ ixt dm(x). 

J — 00 


( 2 ) 

llll 


9.3 Remarks 

(a) In the preceding proof, the appeal to the dominated convergence 
theorem may seem to be illegitimate since the dominated convergence 
theorem deals only with countable sequences of functions. However, it 
does enable us to conclude that 

lim = _; f°° xf(x)e~ ixt dm(t) 

n~* oo S n t J - oo 

for every sequence {s„} which converges to r, and this says exactly that 

lim — ftf) __ _i f X f(x)e~ ixt dm(t). 
s->t s — t J- oo 

We shall encounter similar situations again, and shall apply con- 
vergence theorems to them without further comment. 

(b) Theorem 9.2(h) shows that the Fourier transform of 

[/(* + a) -f(x )-]/ a 


is 


This suggests that an analogue of Theorem 9.2(f) should be true under 
certain conditions, namely, that the Fourier transform of /' is itf (t). If 
/ e L 1 ,/' e L 1 , and if / is the indefinite integral of /', the result is easily 
established by an integration by parts. We leave this, and some related 
results, as exercises. The fact that the Fourier transform converts differ- 
entiation to multiplication by ti makes the Fourier transform a useful 
tool in the study of differential equations. 

The Inversion Theorem 

9.4 We have just seen that certain operations on functions correspond nicely to 
operations on their Fourier transforms. The usefulness and interest of this corre- 
spondence will of course be enhanced if there is a way of returning from the 
transforms to the functions, that is to say, if there is an inversion formula. 
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Let us see what such a formula might look like, by analogy with Fourier 
series. If 


c 


n 


2n 



dx 


then the inversion formula is 


(n e Z), 


(i) 


f(x) = X c n e inx . (2) 

— oo 

We know that (2) holds, in the sense of L 2 -convergence, if / e L 2 (T). We also know 
that (2) does not necessarily hold in the sense of pointwise convergence, even if / 
is continuous. Suppose now that / e I}(T), that { c n } is given by (1), and that 

£|c.|<00. (3) 


Put 


g(x)= Z c H e**. (4) 


By (3), the series in (4) converges uniformly (hence g is continuous), and the 
Fourier coefficients of g are easily computed : 



= (5) 

Thus / and g have the same Fourier coefficients. This implies /= g a.e., so the 
Fourier series of / converges to f(x) a.e. 

The analogous assumptions in the context of Fourier transforms are that 
/ e 1} and /e L 1 , and we might then expect that a formula like 

fix) = [ f(t)e Ux dm{t) (6) 

J— oo 

is valid. Certainly, if/e L 1 , the right side of (6) is well defined; call it g(x); but if 
we want to argue as in (5), we run into the integral 



gift “ s)x 


dx , 


(7) 


which is meaningless as it stands. Thus even under the strong assumption that 
/e L 1 , a proof of (6) (which is true) has to proceed over a more devious route. 
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[It should be mentioned that (6) may hold even if / £ L 1 , if the integral over 
(—00,00) is interpreted as the limit, as 4— >oo, of integrals over ( — A, A). 
(Analogue: a series may converge without converging absolutely.) We shall not 
go into this.] 

9.5 Theorem For any function f on R 1 and every y e R l , let f be the translate 
of f defined by 


ffx) =f(x -y) (xe R l ). (1) 

If 1 < p < oo and iff e IF , the mapping 

y * fy (2) 

is a uniformly continuous mapping of R l into IF(R l ). 

Proof Fix e > 0. Since / e IF , there exists a continuous function g whose 
support lies in a bounded interval [ — 4, A], such that 

II/- g\\ p < « 

(Theorem 3.14). The uniform continuity of g shows that there exists a 
3 e (0, A) such that | s — 1 1 <3 implies 

| g(s)-g{t)\<{?A)-^€. 

If | s — 1 1 < d, it follows that 

I ” | g(x - s) - g(x - t) \" dx < (3 A)~ V(2A + 5) < e', 

J— 00 

so that II g s - g,\\ p < e. 

Note that If-norms (relative to Lebesgue measure) are translation- 
invariant: \\f\\ p = ||/ s || p . Thus 

11/ -/lip ^ II/— 9s\\p + II 9s ~ 9t\\ P + II 9, -/lip 

= ll(/- 0)A, + 110. - 9,\\ P + II (9 ~f)t\\p < 3€ 

whenever | s — t \ < 6. This completes the proof. //// 

9.6 Theorem Iff e If, then f e C 0 and 

ll/lloo < ll/lll- (1) 

Proof The inequality (1) is obvious from 9.1(4). If ► t, then 
1/(0 -/( 01 < P I /Ml \e-“" x - e-“*\ dm(x). 


( 2 ) 
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The integrand is bounded by 2\f(x)\ and tends to 0 for every x, as n— ► oo. 
Hence /(*„)— ►/(O* by the dominated convergence theorem. Thus /is contin- 
uous. 

Since e ni = — 1, 9.1(4) gives 


/w = 


f(x)e 


- it(x + n/t) 


dm(x) 


-f 


f(x — n/t)e itx dm(x). 


Hence 


2/(0 = [°° |/(x) -/(x - dm(x). 


(3) 

(4) 


so that 


2|/(0I ^ \\f—fn/t 111* (5) 

which tends to 0 as t— * ± oo, by Theorem 9.5. //// 

9.7 A Pair of Auxiliary Functions In the proof of the inversion theorem it will be 
convenient to know a positive function H which has a positive Fourier transform 
whose integral is easily calculated. Among the many possibilities we choose one 
which is of interest in connection with harmonic functions in a half plane. (See 
Exercise 25, Chap. 11.) 

Put 


H(t) = e~ w 


( 1 ) 


and define 


h x (x) = 



H(At)e“ x dm(t) 


A simple computation gives 


(A > 0). 


M*) = 


12 X 
yj n X 2 + x 2 


( 2 ) 


(3) 


and hence 


r 


h k (x) dm(x) = 1. 


(4) 


Note also that 0 < H(t) < 1 and that ► 1 as X — ► 0. 

9.8 Proposition /// e L 1 , then 

(/* M*) = I*" H(h)f(ty*< dm(t). 

J— OO 
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Proof This is a simple application of Fubini’s theorem. 


if* M*) 


-r 


fix - y) dm(y) 


H{h)e i,y dm(t) 


■L 


H(At) dm(t) f (x — y)e lty dm(y) 


H(At) dm(t) f f(y)e i,(x y) dm(y) 

J — 00 J — 00 


= H(At) f ( t)e ltx dm(t). 

J “ 


9.9 Theorem IfgeH 0 and g is continuous at a point x, then 

lim [g * h x )(x) = g(x). 

A-+0 


an 

(i) 


Proof On account of 9.7(4), we have 


ig * h^x) - g(x) = 



y) - gix)HK(y) dm(y) 

)0 - dm(y) 

As) — gMJh^s) dm(s). 


The last integrand is dominated by 2\\g\\ o0 h 1 (s) and converges to 0 pointwise 
for every s, as A— >0. Hence (1) follows from the dominated convergence 
theorem. //// 


9.10 Theorem Ifl < p < oo and f e Zf, then 

lim ||/* h k —/||p = 0. (1) 


The cases p = 1 and p — 2 will be the ones of interest to us, but the general 
case is no harder to prove. 


Proof Since h x e 13 , where q is the exponent conjugate to p, (/ * h x )(x) is 
defined for every x. (In fact,/ * h k is continuous; see Exercise 8.) Because of 
9.7(4) we have 


if * hJix) -f{x) = 



[fix - y ) -/(xflMy) dm(y) 


(2) 
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and Theorem 3.3 gives 

I (/ * h x )(x) -f(x) r < | | f(x- y) -f(x) | p h x (y) dm(y). (3) 

J — OO 

Integrate (3) with respect to x and apply Fubini’s theorem: 


Wf*K-fV p < 


” If, -f\\ p P K(y) dm(y). 

— 00 


(4) 


If G(y ) = \\fy—f\\p, then g is bounded and continuous, by Theorem 9.5, and 
t/(0) = 0. Hence the right side of (4) tends to 0 as X — > 0, by Theorem 9.9. //// 


9.11 The Inversion Theorem /// e L 1 and f e I}, and if 


9(x) = 


f(t)e ixl dm(t) 

J - 00 


(x e R 1 ), 


( 1 ) 


then g e C 0 and f (x) = g(x) a.e. 


Proof By Proposition 9.8, 


(f*h x )(x) = 


% 00 

H(Xt) f(t)e ixt dm(t). 

J- 00 


( 2 ) 


The integrands on the right side of (2) are bounded by | f(t) \ , and since 
H(At)-+ 1 as 2— > 0, the right side of (2) converges to g(x), for every x e R 1 , by 
the dominated convergence theorem. 

If we combine Theorems 9.10 and 3.12, we see that there is a sequence 
{2„} such that ► 0 and 


lim (/* hj(x) =f(x) a.e. (3) 


Hence f(x) = g(x) a.e. That g e C 0 follows from Theorem 9.6. //// 


9.12 The Uniqueness Theorem IffeL } and f(t) = 0 for all t e R l , then 
f(x) = 0 a.e. 


Proof Since / = 0 we have / e L 1 , and the result follows from the inversion 
theorem. //// 

The Plancherel Theorem 

Since the Lebesgue measure of R 1 is infinite, L 2 is not a subset of L 1 , and the 
definition of the Fourier transform by formula 9.1(4) is therefore not directly 
applicable to every / e L 2 . The definition does apply, however, if / e L 1 n L 2 , and 
it turns out that then / e L 2 . In fact, ||/|| 2 = II /II 2 - This isometry of L 1 n L 2 into 
L 2 extends to an isometry of L 2 onto L 2 , and this extension defines the Fourier 
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transform (sometimes called the Plancherel transform) of every f e L 2 . The 
resulting L 2 -theory has in fact a great deal more symmetry than is the case in L 1 . 
In L 2 ,/ and / play exactly the same role. 


9.13 Theorem One can associate to each f e L 2 a function f e L 2 so that the 
following properties hold: 

(a) Iff e L 1 n L 2 , then f is the previously defined Fourier transform of f. 

(b) For every f e I?, \\f\\ 2 = \\f\\ 2 - 

(c) The mapping /— »/ is a Hilbert space isomorphism of L 2 onto I}. 

(i d ) The following symmetric relation exists between f and f : If 


(Pa (* ) = 
then || <p A — /|| 



/ ( x)e lxt dm(x) and i// A (x) = 
0 and \\\j/ A — /|| 2 — ► 0 as A—> oo. 


f(t)e lx ‘ dm(t), 

-A 


Note : Since L 1 n L 2 is dense in L 2 , properties (a) and ( b ) determine the mapping 
/— >/ uniquely. Property (d) may be called the L 2 inversion theorem. 


Proof Our first objective is the relation 

II fh = II/II 2 (/e L 1 n L 2 ). (1) 

We fix/ e L 1 n L 2 , put /' (.x) = /( — x), and define g =f *f. Then 

g(x)= j* f(x - y)f(-y) dm(y) = f f(x + y)f(y) dm(y), (2) 

J - 00 J — OO 


or 


g(x) = (f- x ,f), (3) 

where the inner product is taken in the Hilbert space L 2 and f_ x denotes a 
translate of f as in Theorem 9.5. By that theorem, x^>f_ x is a continuous 
mapping of R 1 into L 2 , and the continuity of the inner product (Theorem 4.6) 
therefore implies that g is a continuous function. The Schwarz inequality 
shows that 


|0(*)I< Il/-J 2 ll/Il2= ll/ll! 


so that g is bounded. Also, gel} since / e L 1 and / e L 1 . 
Since g e L 1 , we may apply Proposition 9.8: 


(g * MO) = 


*00 

H(h)g(t) dm(t). 

^ - 00 


Since g is continuous and bounded, Theorem 9.9 shows that 
lim (g* h x )(0) = 0 (O)= || /|| 2 . 

A-0 


( 4 ) 


( 5 ) 


(6) 
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Theorem 92(d) shows that g = | / 1 2 > 0, and since H(Xt) increases to 1 as 
X — ► 0, the monotone convergence theorem gives 


lim P 

A-0 J- 


H(Xt)g(t) dm(t) 


= f 00 1 /mi 2 

J- 00 


dm(t). 


(7) 


Now (5), (6), and (7) shows that / e L 2 and that (1) holds. 

This was the crux of the proof. 

Let Y be the space of all Fourier transforms / of functions / e L 1 n I?. By 
(1), Y <= L 2 . We claim that 7 is dense in I 2 , i.e„ that Y 1 = {0}. 

The functions x— » e ,xx H(Ax) are in L 1 n L 2 , for all real a and all 1 > 0. 
Their Fourier transforms 


h x (a. - t) 


■f 


e ic,x H(Xx)e~ ix ' dm{x) 


are therefore in Y. If w e 13, w e T 1 , it follows that 


(/j A * w)(a) 


-J'*- 


(a — t)iv(t) Jw(r) = 0 


( 8 ) 


(9) 


for all a. Hence w = 0, by Theorem 9.10, and therefore Y is dense in L 2 . 

Let us introduce the temporary notation <S)f for /. From what has been 
proved so far, we see that <I> is an L 2 -isometry from one dense subspace of L 2 , 
namely L 1 n L 2 , onto another, namely Y. Elementary Cauchy sequence argu- 
ments (compare with Lemma 4.16) imply therefore that extends to an 
isometry $ of L 2 onto L 2 . If we write /for 3>/, we obtain properties (a) and ( b ). 

Property (c) follows from ( b ), as in the proof of Theorem 4.18. The Par - 
seval formula 


% 00 
J- c 


f(x)g(x) dm(x) 


% 00 

= fmt) 

J — 00 


dm(t) 


(10) 


holds therefore for all / e L 2 and g e L 2 . 

To prove (d), let k A be the characteristic function of [ — A, A]. Then 
k A f e 1} n L 2 if/e L 2 , and 


<Pa = ff- 

Since || / — k A f || 2 — ► 0 as A — > oo, it follows from (b) that 
\\/-<Pa\\ 2= \\(f-k A ff\\ 2 -+0 

as A — > oo. 

The other half of (d) is proved the same way. 

9.14 Theorem Iff e L 2 and / e L 1 , then 


( 11 ) 

(12) 

llll 


fix) 


f(t)e ,xt dm(t ) a. o’. 
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Proof This is corollary of Theorem 9.13(6?). //// 

9.15 Remark If f e L 1 , formula 9.1(4) defines f(t) unambiguously for every t. If 
/ e L 2 , the Plancherel theorem defines / uniquely as an element of the Hilbert 
space L 2 , but as a point function f(t) is only determined almost everywhere. 
This is an important difference between the theory of Fourier transforms in 
l } and in L 2 . The indeterminacy of f(t) as a point function will cause some 
difficulties in the problem to which we now turn. 

9.16 Translation-Invariant Subspaces of L 2 A subspace M of L 2 is said to be 
translation-invariant if / e M implies that / a e M for every real a, where ffx) = 
f(x — a). Translations have already played an important part in our study of 
Fourier transforms. We now pose a problem whose solution will afford an illus- 
tration of how the Plancherel theorem can be used. (Other applications will occur 
in Chap. 19.) The problem is: 

Describe the closed translation-invariant subspaces of L 2 . 

Let M be a closed translation-invariant subspace of L 2 , and let M be the 
image of M under the Fourier transform. Then M is closed (since the Fourier 
transform is an L 2 - isometry). If f a is a translate of f the Fourier transform of f a is 
fe a , where eft) = e~ iat \ we proved this for / e L 1 in Theorem 9.2; the result 
extends to L 2 , as can be seen from Theorem 9.13(6?). It follows that M is invariant 
under multiplication by e a ,for all a e JR 1 . 

Let E be any measurable set in JR 1 . If M is the set of all cp e L 2 which vanish 
a.e. on E , then M certainly is a subspace of L 2 , which is invariant under multipli- 
cation by all e a (note that \e a \ = 1, so (pe a e L 2 if cp e L 2 ), and M is also closed. 
Proof : cp e M if and only if cp is orthogonal to every ij/ e L 2 which vanishes a.e. on 
the complement of E. 

If M is the inverse image of this M, under the Fourier transform, then M is a 
space with the desired properties. 

One may now conjecture that every one of our spaces M is obtained in this 
manner, from a set E a JR 1 . To prove this, we have to show that to every closed 
translation-invariant M L 2 there corresponds a set E a R l such that / e M if 
and only if f(t) = 0 a.e. on E. The obvious way of constructing E from M is to 
associate with each / e M the set E f consisting of all points at which / ( t ) = 0, and 
to define E as the intersection of these sets E f . But this obvious attack runs into 
a serious difficulty: Each E f is defined only up to sets of measure 0. If {A^ is a 
countable collection of sets, each determined up to sets of measure 0, then Q A t 
is also determined up to sets of measure 0. But there are uncountably many 
/ e Af, so we lose all control over P) E f . 

This difficulty disappears entirely if we think of our functions as elements of 
the Hilbert space L 2 , and not primarily as point functions. 

We shall now prove the conjecture. Let M be the image of a closed 
translation-invariant subspace M a L 2 , under the Fourier transform. Let P be the 
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orthogonal projection of L 2 onto M (Theorem 4.11): To each / e L 2 there corre- 
sponds a unique Pf e M such that f — Pf is orthogonal to M. Hence 

f-PfLPg (f and gel}) (1) 

and since M is invariant under multiplication by e a , we also have 

f- Pf -L ( p g)e* (/and gel}, a e K 1 ). (2) 

If we recall how the inner product is defined in L 2 , we see that (2) is equivalent to 
*00 

(/- Pf) • Pg • e-a dm = 0 (/and g e II, oc e R 1 ) (3) 

J— 00 

and this says that the Fourier transform of 

{f-Pf) Tg (4) 

is 0. The function (4) is the product of two L 2 -functions, hence is in L 1 , and the 
uniqueness theorem for _Fourier transforms shows now that the function (4) is 0 
a.e. This remains true if Pg is replaced by Pg. Hence 

f - Pg = (Pf) (Pg) (/ and g e L 2 ). (5) 

Interchanging the roles of / and g leads from (5) to 

f -Pg = g Pf (/ and g e I}). (6) 

Now let g be a fixed positive function in I}; for instance, put g(t) = e |r| . 
Define 


<p(t) = 


[PgM 

g(t) ' 


( 7 ) 


(Pg)(t) may only be defined a.e.; choose any one determination in (7). Now (6) 
becomes 


Pf=<P'f (/eL 2 ). (8) 

If f e M, then Pf — f This says that P 2 = P, and it follows that (p 2 — <p, 
because 

(p 2 g = cp * Pg = P 2 g = Pg = <p * g. (9) 

Since cp 2 = (p , we have cp = 0 or 1 a.e., and if we let E be the set of all t where 
cp{t) = 0, then M consists precisely of those / e L 2 which are 0 a.e. on £, since 
f e M if and only \if=Pf—cp • /. 

We therefore obtain the following solution to our problem. 
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9.17 Theorem Associate to each measurable set E a R 1 the space M E of all 
f e L 2 such that / = 0 a.e. on E. Then M E is a closed translation-invariant sub- 
space of L 2 . Every closed translation-invariant subspace of li is M E for some E , 
and M A = M B if and only if 

m((A - B) u (B - A)) = 0. 

The uniqueness statement is easily proved; we leave the details to the reader. 
The above problem can of course be posed in other function spaces. It has 
been studied in great detail in L 1 . The known results show that the situation is 
infinitely more complicated there than in L 2 . 

The Banach Algebra L 1 

9.18 Definition A Banach space A is said to be a Banach algebra if there is a 
multiplication defined in A which satisfies the inequality 

\\xy\\ < ||x|| Ml (x and y e A), (1) 

the associative law x(yz) = (xy)z, the distributive laws 

x(y 4- z) = xy 4- xz, (y + z)x = yx 4- zx (x, y, and z e A\ (2) 

and the relation 

(ax)y = x(ay) = a(xy) (3) 

where a is any scalar. 

9.19 Examples 

(a) Let A = C(X\ where X is a compact Hausdorff space, with the 
supremum norm and the usual pointwise multiplication of functions: 
(fd)( x ) = /(x)fif(x). This is a commutative Banach algebra ( fg = gf) with 
unit (the constant function 1). 

(b) CoiR 1 ) is a commutative Banach algebra without unit, i.e., without an 
element u such that uf = f for all / e CqCR 1 ). 

(c) The set of all linear operators on R k (or on any Banach space), with the 
operator norm as in Definition 5.3, and with addition and multiplication 
defined by 

(A + B)(x) = Ax 4- Bx, ( AB)x = A(Bx), 

is a Banach algebra with unit which is not commutative when k > 1. 

{d) l } is a Banach algebra if we define multiplication by convolution; since 

ll/*0lli<ll/lliMli, 

the norm inequality is satisfied. The associative law could be verified 
directly (an application of Fubini’s theorem), but we can proceed as 
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follows: We know that the Fourier transform of /* g is/ g, and we 
know that the mapping/— > /is one-to-one. For every t e R \ 

/(fM0%)] = U\t)g(t)](i(t), 

by the associative law for complex numbers. It follows that 
f *(g * h) = {f * g)* h. 

In the same way we see immediately that f * g = g *f The remaining 
requirements of Definition 9.18 are also easily seen to hold in L 1 . 

Thus L 1 is a commutative Banach algebra. The Fourier transform is 
an algebra isomorphism of L 1 into C 0 . Hence there is no / e L 1 with 
/si, and therefore L 1 has no unit. 

9.20 Complex Homomorphisms The most important complex functions on a 
Banach algebra A are the homomorphisms of A into the complex field. These are 
precisely the linear functionals which also preserve multiplication, i.e., the func- 
tions cp such that 

<p(ocx + py) = oup(x) + P<p{y), <p(xy) = (p(x)q>(y) 

for all x and y e A and all scalars a and /?. Note that no boundedness assumption 
is made in this definition. It is a very interesting fact that this would be 
redundant: 

9.21 Theorem If cp is a complex homomorphism on a Banach algebra A , then 
the norm of (p , as a linear functional , is at most 1. 

Proof Assume, to get a contradiction, that | <p(x 0 )| > ||x 0 || for some x 0 e A. 
Put X = <p(x 0 ), and put x = x 0 /A. Then ||x|| < 1 and cp(x) — 1. 

Since ||x w || < ||x|| n and ||x|| < 1, the elements 

s n =-x-x 2 x" (1) 

form a Cauchy sequence in A. Since A is complete, being a Banach space, 
there exists a y e A such that ||y — sj — > 0, and it is easily seen that x + s„ = 
xs n _ l9 so that 

x + y = xy. (2) 

Hence (p{x ) + q>{y ) = <p(x)<p(y), which is impossible since <p(x) = 1. //// 

9.22 The Complex Homomorphisms of L 1 Suppose cp is a complex homo- 
morphism of L 1 , i.e., a linear functional (of norm at most 1, by Theorem 9.21) 
which also satisfies the relation 


<p(f * g) = <P(fMg) (/ and g e L 1 ). 


( 1 ) 
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By Theorem 6.16, there exists a /? e L 00 such that 

<?(/)= f f(x)P(x) dm(x) ( fell t ). (2) 

J- OO 

We now exploit the relation (1) to see what else we can say about /?. On the one 
hand, 


<p(f 


g)(x)P(x) dm(x) 


* 9) = j if* 

J— 00 

= J P(x) dm(x) j* 

f* 00 f* 00 

= 0OO<M>O 

J — 00 J — c 

-r 


/?(x) dm(x) /(* - y)gf(y) dmCy) 

O J — 00 

^0) <im(» [ f y (x)P(x) dm(x) 
g(y)<Pify) dm(y). 


On the other hand. 


(3) 


<pif)<pig) = <p(f) f g(y)Piy) dm{y). (4) 

J- 00 

Let us now assume that (p is not identically 0. Fix / e L 1 so that <p(/ ) ^ 0. 
Since the last integral in (3) is equal to the right side of (4) for every gel}, the 
uniqueness assertion of Theorem 6.16 shows that 


cp(my) = <P(fy) (5) 

for almost all y. But y— >/ y is a continuous mapping of R 1 into L 1 (Theorem 9.5) 
and q> is continuous on L 1 . Hence the right side of (5) is a continuous function of 
y, and we may assume [by changing fi(y) on a set of measure 0 if necessary* which 
does not affect (2)] that ft is continuous. If we replace y by x + y and then /by f x 
in (5), we obtain 


<P(f)P(x + y) = cp(f x+y ) = cp((f x ) y ) = cp(fMy) = cp(f)p(x)p(y ), 


so that 


P(x + y) = P(x)P(y) (x and y e R 1 ). (6) 

Since ft is not identically 0, (6) implies that /?(0) = 1, and the continuity of ft 
shows that there is a d > 0 such that 

^Piy)dy = c¥= 0 . 


(7) 
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Then 


cm = ("mm d y = ("^ + X )d y = ( x+ "m d y . ( 8 ) 

Jo Jo Jx 

Since P is continuous, the last integral is a differentiable function of x; hence (8) 
shows that ft is differentiable. Differentiate (6) with respect to y, then put y = 0; 
the result is 


m = AM, a = m- (9) 

Hence the derivative of P(x)e~ Ax is 0 , and since /?( 0 ) = 1, we obtain 

p(x) = e Ax . (10) 

But P is bounded on JR 1 . Therefore A must be pure imaginary, and we conclude: 
There exists a t e R l such that 


P(x) = e~ itx . (11) 

We have thus arrived at the Fourier transform. 

9.23 Theorem To every complex homomorphism (p on L 1 (except to cp = 0) 
there corresponds a unique t e R l such that cp(f) = / ( t ). 

The existence of t was proved above. The uniqueness follows from the obser- 
vation that if t ^ s then there exists an / e L 1 such that f(t) ¥^f(s)\ take for f(x) a 
suitable translate of e~ |x| . 

Exercises 

1 Suppose / e L 1 ,/ > 0. Prove that \f{y) \ < /( 0) for every y # 0. 

2 Compute the Fourier transform of the characteristic function of an interval. For n = 1, 2, 3, let 
g n be the characteristic function of [ — n, n], let h be the characteristic function of [—1, 1], and 
compute g n * h explicitly. (The graph is piecewise linear.) Show that g n * h is the Fourier transform of 
a function f n e L 1 ; except for a multiplicative constant, 

„ sin x sin nx 


Show that II /„|| i — ► oo and conclude that the mapping/— ► / maps I) into a proper subset of C 0 . 
Show, however, that the range of this mapping is dense in C 0 . 

3 Find 

C A sin It 

lim e dt ( — oo < x < oo) 

A-* oo J-A * 


where A is a positive constant. 

4 Give examples of/ e L 2 such that / £ L 1 but /e L 1 . Under what circumstances can this happen? 
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5 If / e L 1 and J |t/(t) | dm{t ) < oo, prove that / coincides a.e. with a differentiable function whose 
derivative is 



6 Suppose /e L 1 ,/ is differentiable almost everywhere, and /' e L 1 . Does it follow that the Fourier 
transform of /' is tif(t)l 

7 Let S be the class of all functions /on R 1 which have the following property: /is infinitely differen- 
tiable, and there are numbers A mn (f) < oo, for m and n = 0, 1,2,..., such that 

I x n D m f(x) | < 4 „,,,(/) (xeK 1 ). 

Here D is the ordinary differentiation operator. 

Prove that the Fourier transform maps S onto S. Find examples of members of S. 

8 If p and q are conjugate exponents, / e L p , g e 13, and h =f * g, prove that h is uniformly contin- 
uous. If also 1 < p < oo, then h e C 0 ; show that this fails for some f e I}, g e L 00 . 

9 Suppose 1 < p < oo,/ e If, and 

Prove that g e C 0 . What can you say about g iff e L? ? 

10 Let C 00 be the class of all infinitely differentiable complex functions on R 1 , and let C c °° consist of 
all g e C 00 whose supports are compact. Show that C® does not consist of 0 alone. 

Let L} oc be the class of all / which belong to L 1 locally; that is,/ e L} oc provided that /is measur- 
able and Jj | / 1 < oo for every bounded interval /. 

If/e L} oc and # e C c °°, prove that / * g e C 00 . 

Prove that there are sequences {g n } in C® such that 


as n— > oo, for every /e L 1 . (Compare Theorem 9.10.) Prove that {g n } can also be so chosen that 
(/ * 0„Xx)— ^ ►/(*) a.e., for every / e L, 1 ^ ; in fact, for suitable {g n } the convergence occurs at every point 
x at which / is the derivative of its indefinite integral. 

Prove that (/ * h x )(x )^ ►/(*) a.e. if / e L 1 , as A— ► 0, and that f * h x e C 00 , although h x does not 
have compact support. (/z A is defined in Sec. 9.7.) 

11 Find conditions on / and/or / which ensure the correctness of the following formal argument: If 




dx 


and 


F(x)= X f(x + 2kn) 

k = - oo 

then F is periodic, with period 2n , the nth Fourier coefficient of F is <p(n), hence F(x) = £ <p(n)e inx . In 
particular, 


Z f( 2kn )= Z <?(")• 

k= — oo n= — oo 


oo oo 

Z = a Z <^( wa ) if a > 0, p > 0, ap = 2n. (*) 

k=-oo n=-oo 


More generally, 
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What does (*) say about the limit, as a— ► 0, of the right-hand side (for “nice” functions, of course)? Is 
this in agreement with the inversion theorem? 

[(*) is known as the Poisson summation formula.] 

12 Take f(x) = e~ M in Exercise 11 and derive the identity 

e 2na + 1 1 ® « 

e 2 ™- 1 _ 7T a 2 + n 2 ’ 

13 IfO < c < oo, define f c (x) = exp ( — cx 2 ). 

(a) Compute f e . Hint : If (p =f c , an integration by parts gives 2 c(p'{t) + t(p(t) = 0. 

(b) Show that there is one (and only one) c for which f c = / c . 

( c ) Show that f a *f b = yf c ; find y and c explicitly in terms of a and b. 

(i d ) Take f = f c in Exercise 11. What is the resulting identity? 

14 The Fourier transform can be defined for / e li(R k ) by 


/O') = I f(x)e~ ix > dm k (x) (y e R\ 

JR k 

where x • y = ]T ^ if x = ({ lt . . . , £ k ), y = foj, . . . , ?/ k ), and m k is Lebesgue measure on R k , divided by 
(27r) k/2 for convenience. Prove the inversion theorem and the Plancherel theorem in this context, as 
well as the analogue of Theorem 9.23. 

15 If f e L}(R k ), A is a. linear operator on R k , and g(x) = f(Ax), how is g related to /? If /is invariant 
under rotations, i.e., if f{x) depends only on the euclidean distance of x from the origin, prove that the 
same is true of f. 

16 The Laplacian of a function / on R k is 


A/= £ 

i- l 


Sxj’ 


provided the partial derivatives exist. What is the relation between / and g if g = A/ and all necessary 
integrability conditions are satisfied? It is clear that the Laplacian commutes with translations. Prove 
that it also commutes with rotations, i.e., that 


A(/o A) = (A/) o A 

whenever / has continuous second derivatives and A is a rotation of R k . (Show that it is enough to do 
this under the additional assumption that / has compact support.) 

17 Show that every Lebesgue measurable character of R l is continuous. Do the same for R k . (Adapt 
part of the proof of Theorem 9.23.) Compare with Exercise 18. 

18 Show (with the aid of the Hausdorff maximality theorem) that there exist real discontinuous func- 
tions /on R 1 such that 

f(x + y)=f(x)+f{y) (1) 

for all x and y e R 1 . 

Show that if (1) holds and /is Lebesgue measurable, then /is continuous. 

Show that if (1) holds and the graph of /is not dense in the plane, then /is continuous. 

Find all continuous functions which satisfy (1). 

19 Suppose A and B are measurable subsets of R l , having finite positive measure. Show that the 
convolution Xa * 1b continuous and not identically 0. Use this to prove that A + B contains a 
segment. 

(A different proof was suggested in Exercise 5, Chap. 7.) 
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TEN 


ELEMENTARY PROPERTIES OF 
HOLOMORPHIC FUNCTIONS 


Complex Differentiation 

We shall now study complex functions defined in subsets of the complex plane. It 
will be convenient to adopt some standard notations which will be used through- 
out the rest of this book. 

10.1 Definitions If r > 0 and a is a complex number, 

D(a;r) = {z:\z - a\ <r} (1) 

is the open circular disc with center at a and radius r. D(a ; r) is the closure of 
D(a ; r), and 

D'(a ; r) = {z: 0 < \ z — a \ < r} (2) 

is the punctured disc with center at a and radius r. 

A set £ in a topological space X is said to be not connected if E is the 
union of two nonempty sets A and B such that 

AnB = 0 = AnB. (3) 

If A and B are as above, and if V and W are the complements of A and 
5, respectively, it follows that A <^W and B a V. Hence 

E^V kjW, E n V =£0, E n W * 0, E n V n W = 0. (4) 

Conversely, if open sets V and W exist such that (4) holds, it is easy to 
see that E is not connected, by taking A = E n W, B = E n V. 

If E is closed and not connected, then (3) shows that E is the union of 
two disjoint nonempty closed sets; for if A cz A u B and A n B = 0, then 
A = A. 
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If E is open and not connected, then (4) shows that E is the union of two 
disjoint nonempty open sets, namely E n V and E n W. 

Each set consisting of a single point is obviously connected. If x e E, the 
family of all connected subsets of E that contain x is therefore not empty. 
The union of all members of is easily seen to be connected, and to be a 
maximal connected subset of E. These sets are called the components of E. Any 
two components of E are thus disjoint, and E is the union of its components. 

By a region we shall mean a nonempty connected open subset of the 
complex plane. Since each open set Q in the plane is a union of discs, and 
since all discs are connected, each component of Q is open. Every plane open 
set is thus a union of disjoint regions. The letter Q will from now on denote a 
plane open set. 


10.2 Definition Suppose / is a complex function defined in Q. If z 0 e Qand if 


lim 

Z-+ZQ 


/(*) -/(* o) 
z - z 0 


( 1 ) 


exists, we denote this limit by f'(z 0 ) and call it the derivative of /at z 0 . If f'(z 0 ) 
exists for every z 0 e Q, we say that / is holomorphic (or analytic) in Q. The 
class of all holomorphic functions in Q will be denoted by H(Q). 

To be quite explicit, f\z 0 ) exists if to every e > 0 there corresponds a 
3 > 0 such that 


m -f(z 0 ) 

z - z 0 


-/'(z o) 


< 6 


for all z e D'(z 0 ; 3). 


(2) 


Thus f'(z 0 ) is a complex number, obtained as a limit of quotients of 
complex numbers. Note that / is a mapping of Q into R 2 and that Definition 
7.22 associates with such mappings another kind of derivative, namely, a 
linear operator on R 2 . In our present situation, if (2) is satisfied, this linear 
operator turns out to be multiplication by f\z 0 ) (regarding R 2 as the complex 
field). We leave it to the reader to verify this. 


10.3 Remarks If / e H(Q) and g e H(Q\ then also / 4- g e H(Q) and 
fg e H(Q) 9 so that H{Q) is a ring; the usual differentiation rules apply. 

More interesting is the fact that superpositions of holomorphic functions 
are holomorphic: Iff e H(Q), if f(Q) a Q 1? if g e //(QJ, and ifh = g°f then 
h e H(Q\ and h' can be computed by the chain rule 

h’(z 0 ) = g'(f{z 0 ))f'(z 0 ) (z 0 e G). ( 1 ) 

To prove this, fix z 0 e £2, and put w 0 = f(z 0 ). Then 


/(z) /(z 0 ) = Lf(z 0 ) + e(z)](z - z 0 ), 

g(w) - g(w 0 ) = [ 0 '(w o ) + f/(w)](w - w 0 ), 


( 2 ) 

( 3 ) 
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where €(z)— ► 0 as z— ► z 0 and //(w)— ► 0 as w— ► w 0 . Put w = /(z), and substitute 
(2) into (3): If z # z 0 , 

— ~ /l(2 ° ) = [0'(/(z o )) + V(/(z))][/'(z 0 ) + <z)l (4) 

z — z 0 

The differentiability of / forces / to be continuous at z 0 . Hence (1) follows 
from (4). 

10.4 Examples For n = 0, 1, 2, . . . , z n is holomorphic in the whole plane, and 
the same is true of every polynomial in z. One easily verifies directly that 1/z 
is holomorphic in (z: z # 0}. Hence, taking g(w) = 1/w in the chain rule, we 
see that if / x and f 2 are in H(Q) and Q 0 is an open subset of Q in which f 2 has 
no zero, then fjf 2 e if(Q 0 ). 

Another example of a function which is holomorphic in the whole plane 
(such functions are called entire) is the exponential function defined in the 
Prologue. In fact, we saw there that exp is differentiable everywhere, in the 
sense of Definition 10.2, and that exp' (z) = exp (z) for every complex z. 

10.5 Power Series From the theory of power series we shall assume only one fact 
as known, namely, that to each power series 

I c „(2 - a) n (1) 

>1 = 0 

there corresponds a number R e [0, oo] such that the series converges absolutely 
and uniformly in D(a; r), for every r < R, and diverges if z 4 5(a ; R). The “radius 
of convergence” R is given by the root test: 

■^ = lim sup | cJ 1/B . (2) 

Let us say that a function / defined in Q is representable by power series in Q 
if to every disc D(a ; r) c= Q there corresponds a series (1) which converges to /(z) 
for all z e D(a ; r). 

10.6 Theorem Iff is representable by power series in Q, then f e H(Q) and /' is 
also representable by power series in Q. In fact , if 

m = 1 c n (z - af (1) 

>1 = 0 

for z e D(a ; r), then for these z we also have 

/'(z)= E nc„(z - af~K 
> 1=1 


( 2 ) 
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Proof If the series (1) converges in D(a ; r), the root test shows that the series 
(2) also converges there. Take a — 0, without loss of generality, denote the 
sum of the series (2) by g{z\ fix w e D(a ; r), and choose p so that | w | < p < r. 
If z 7 ^ w, we have 


m -m 


g(w) 


Z — W 

The expression in brackets is 0 if n— 1, and is 


= Z ~ nw • 

„ = i L z ~ w 


(z — w) £ kw k *z" 


— k — 1 


if n > 2. If | z | < p, the absolute value of the sum in (4) is less than 


(3) 

(4) 


”(”-!) -I,— 2 

2 ^ 


(5) 


SO 


< |z - W| Z n 2 l c »lp" 2 - ( 6 ) 

n = 2 

Since p < r, the last series converges. Hence the left side of ( 6 ) tends to 0 as 
z— ► w. This says that /'(w) = p(w), and completes the proof. //// 

Corollary Since /' is seen to satisfy the same hypothesis as f does , t/ie theorem 
can be applied to /'. It follows that f has derivatives of all orders , t/iat eacii 
derivative is representable by power series in Q, and t/iat 


00 


/ <k, (z) = I»(n - 1) • 

n = k 

■ (n — k + 1 )c B (z — a)" * 

(7) 

if(l) holds. Hence (1) implies that 

k\c k =f {k \a) 

(fc = 0, 1,2,...), 

(8) 


so t/iat /or eac/i a e Q there is a unique sequence {c n }for which ( 1 ) holds. 

We now describe a process which manufactures functions that are represent- 
able by power series. Special cases will be of importance later. 

10.7 Theorem Suppose p is a complex ( finite ) measure on a measurable space 
X , (p is a complex measurable function on X , Q is an open set in the plane 
which does not intersect (p(X\ and 

,, V f dp ( 0 

/(z) -i^cFT 

T/ien / is representable by power series in Q. 


/(z) -/(w) 

z — w 


0(w) 


(z e O). 


(1) 
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Proof Suppose D(a ; r) c= Q. Since 

z — a 




(p(Q-a\ 

for every z e D(a ; r) and every (el, the geometric series 
“ (z - ay 1 

B =o(<p(0-«r +1 


( 2 ) 


( 3 ) 


converges uniformly on X, for every fixed z e D(a; r). Hence the series (3) 
may be substituted into (1), and /(z) may be computed by interchanging sum- 
mation and integration. It follows that 


/(z) = Z c„(z - a)" (z 6 D(a; /•)) 


where 


-I 




(<p( o - «r 


(n = 0, 1, 2,...). 


( 4 ) 

( 5 ) 

till 


Note : The convergence of the series (4) in D(a; r) is a consequence of the 
proof. We can also derive it from (5), since (5) shows that 


kJ< 


\V\(X) 


(n = 0, 1, 2,...). 


( 6 ) 


Integration over Paths 

Our first major objective in this chapter is the converse of Theorem 10.6: Every 
f e H(Q) is representable by power series in Q. The quickest route to this is via 
Cauchy’s theorem which leads to an important integral representation of holo- 
morphic functions. In this section the required integration theory will be devel- 
oped; we shall keep it as simple as possible, and shall regard it merely as a useful 
tool in the investigation of properties of holomorphic functions. 

10.8 Definitions If AT is a topological space, a curve in AT is a continuous 
mapping y of a compact interval [a, /?] c= R l into X; here a < /?. We call 
[a, /?] the parameter interval of y and denote the range of y by y*. Thus y is a 
mapping, and y* is the set of all points y(t\ for a < t < /?. 

If the initial point y(a) of y coincides with its end point y(/?), we call y a 
closed curve. 

A path is a piecewise continuously differentiable curve in the plane. More 
explicitly, a path with parameter interval [a, /?] is a continuous complex 
function y on [a, /?], such that the following holds: There are finitely many 
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points Sj, a = s 0 < s t < • • • < s n = /?, and the restriction of y to each interval 
[Sj- t , sj] has a continuous derivative on ls j _ i , Sj ] ; however, at the points 
s l9 . . . , 5 „_ x the left- and right-hand derivatives of y may differ. 

A closed path is a closed curve which is also a path. 

Now suppose y is a path, and / is a continuous function on y*. The 
integral of / over y is defined as an integral over the parameter interval [a, /?] 
of y: 

|/(z) dz = j jmmt) dt. (i) 


Let (p be a continuously differentiable one-to-one mapping of an interval 
Oi, f}{\ onto [a, /?], such that cp(tx ^ = a, (piPJ = /?, and put y x = y ° q>. Then 
y x is a path with parameter interval [a l5 jS x ] ; the integral of / over yj is 


f(y 


iW)yi(f) dt = f j 

Jai 


f(y(<p(t)))y'(<p(t))<p'(t) dt = 


f(y( s ))y'(s) ds, 


so that our “ reparametrization ” has not changed the integral: 


f f(z) dz= f f(z) dz. 
Jy 1 Jy 


(2) 


Whenever (2) holds for a pair of paths y and y x (and for all /), we shall 
regard y and y t as equivalent. 

It is convenient to be able to replace a path by an equivalent one, i.e., to 
choose parameter intervals at will. For instance, if the end point of y x coin- 
cides with the initial point of y 2 , we may locate their parameter intervals so 
that y l and y 2 join to form one path y, with the property that 



( 3 ) 


for every continuous /on y* = yf u yf . 

However, suppose that [0, 1] is the parameter interval of a path y, and 
yi(0 = y(l — t\ 0 < t < 1. We call y l the path opposite to y, for the following 
reason: For any / continuous on yf = y*, we have 


/( 7 i(f))y'i(f) dt = 

Jo 


•l 

Jo 


f(y(i - 0)y'(i — t) dt = 


/(y(s))y'(s) ds, 
Jo 


so that 



(4) 
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From (1) we obtain the inequality 


I 


f(z) dz 


f 


1/01 du 


(5) 


where ||/|| ^ is the maximum of |/| on y* and the last integral in (5) is (by 
definition) the length of y. 


10.9 Special Cases 

(a) If a is a complex number and r > 0, the path defined by 

y(t) = a + re lt (0 < t < 2n) (1) 

is called the positively oriented circle with center at a and radius r; we have 


Jy 


f(z) dz = ir 


'In 


f(a + re id )e id dO , 


and the length of y is 2 nr, as expected. 

(b) If a and b are complex numbers, the path y given by 


( 2 ) 


y(t) = a 4- (b — a)t (0 < t < 1) 


(3) 


is the oriented interval [a, h] ; its length is | b — a | , and 

f(z) dz = (b — a) I /[a + (b - a)t ] dt. (4) 

J[a, b] JO 

If 


y i0 


a(P — t) 4- b(t — a) 

P — CL 


(a < t < P\ 


(5) 


we obtain an equivalent path, which we still denote by [a, b]. The path 
opposite to [a, b] is [b, a]. 

(c) Let {a, b , c} be an ordered triple of complex numbers, let 


A = A(a, b , c) 


be the triangle with vertices at a, b , and c (A is the smallest convex set which 
contains a, b, and c), and define 


[ /= [ /+ f /+ f /, (6) 

JdA J[a, b] J[b , c] J[c, a] 

for any / continuous on the boundary of A. We may regard (6) as the defini- 
tion of its left side. Or we may regard dA as a path obtained by joining [a, b] 
to [b, c] to [c, a], as outlined in Definition 10.8, in which case (6) is easily 
proved to be true. 
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If { a , b, c } is permuted cyclically, we see from (6) that the left side of (6) is 
unaffected. If {a, b, c } is replaced by { a , c, b}, then the left side of (6) changes 
sign. 

We now come to a theorem which plays a very important role in function 
theory. 

10.10 Theorem Let y be a closed path , let Q be the complement of y* ( relative 
to the plane), and define 

Ind > (z) - i I rb (zsn) (1) 

Then Ind y is an integer-valued function on Q which is constant in each com- 
ponent of Q and which is 0 in the unbounded component of Q. 

We call Ind y (z) the index of z with respect to y. Note that y* is compact, 
hence y* lies in a bounded disc D whose complement D c is connected; thus D c lies 
in some component of Q. This shows that Q has precisely one unbounded com- 
ponent. 

Proof Let [a, /?] be the parameter interval of y, fix z e Q, then 

1 f' y'(s) 

Indy (z) = — ds . (2) 

y 2m i y(s) - z 

Since w/2ni is an integer if and only if e w = 1, the first assertion of the 
theorem, namely, that Ind y (z) is an integer, is equivalent to the assertion that 
(p(f) = 1, where 

(p(t) = exp ~ (a < f < p). (3) 

Differentiation of (3) shows that 

m = _m_ 

(p(t) y(t) - z 

except possibly on a finite set S where y is not differentiable. Therefore 
(p/(y — z) is a continuous function on [a, /?] whose derivative is zero in 
[a, /?] — S. Since S is finite, cp/(y — z) is constant on [a, /?] ; and since 
cp{ a) = 1, we obtain 


We now use the assumption that y is a closed path, i.e., that y(fi) = y( a); 
(5) shows that cp{fi) = 1, and this, as we observed above, implies that Ind y (z) 
is an integer. 
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By Theorem 10.7, (1) shows that Ind y e H(Q). The image of a connected 
set under a continuous mapping is connected ([26], Theorem 4.22), and since 
Ind y is an integer-valued function, Ind y must be constant on each component 
of Q. 

Finally, (2) shows that | Ind y (z) | < 1 if | z | is sufficiently large. This 
implies that Ind y (z) = 0 in the unbounded component of Q. //// 

Remark: If 2(f) denotes the integral in (3), the preceding proof shows that 
2 n Indy (z) is the net increase in the imaginary part of A(r), as t runs from a to 
1 3 , and this is the same as the net increase of the argument of y(t) — z. (We 
have not defined “ argument ” and will have no need for it.) If we divide this 
increase by 2 n, we obtain “ the number of times that y winds around z,” and 
this explains why the term “winding number” is frequently used for the 
index. One virtue of the preceding proof is that it establishes the main 
properties of the index without any reference to the (multiple-valued) argu- 
ment of a complex number. 


10.11 Theorem If y is the positively oriented circle with center at a and radius 
r, then 


Indy (z) = 


if | z — a | < r, 
if | z — a | > r. 


Proof We take y as in Sec. 10.9(a). By Theorem 10.10, it is enough to 
compute Indy (a), and 10.9(2) shows that this equals 


2ni 



r 

2 n 



(re it )~ i e it dt 


= 1 . 


Ill / 


The Local Cauchy Theorem 

There are several forms of Cauchy’s theorem. They all assert that if y is a closed 
path or cycle in Q, and if y and Q satisfy certain topological conditions, then the 
integral of every / e H(Q) over y is 0. We shall first derive a simple local version 
of this (Theorem 10.14) which is quite sufficient for many applications. A more 
general global form will be established later. 

10.12 Theorem Suppose F e H(Q) and F' is continuous in Q. Then 

J F'(z) dz = 0 


for every closed path y in Q. 
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Proof If [a, /?] is the parameter interval of y, the fundamental theorem of 
calculus shows that 

f V(z) dz = fV'(y(t))y'(0 dt = - F(y(a)) = 0, 

Jy Joe 

since y(fi) = y(a). //// 

Corollary Since z n is the derivative of z n+i /(n + 1) for all integers n / — 1, we 
have 


z n dz = 0 


for every closed path y if n = 0, 1, 2, , and for those closed paths y for which 

0 £y*ifn= -2, -3, -4,.... 

The case n = — 1 was dealt with in Theorem 10.10. 


10.13 Cauchy’s Theorem for a Triangle Suppose A is a closed triangle in a 
plane open set Q,peQ,f is continuous on Q, and f e H(Q — {p}). Then 

f m dz = o. (i) 

J0A 

For the definition of dA we refer to Sec. 10.9(c). We shall see later that our 
hypothesis actually implies that / e H(Q), i.e., that the exceptional point p is not 
really exceptional. However, the above formulation of the theorem will be useful 
in the proof of the Cauchy formula. 

Proof We assume first that p $ A. Let a, b , and c be the vertices of A, let a', 
b\ and c' be the midpoints of [ b , c], [c, a], and [a, b], respectively, and con- 
sider the four triangles A j formed by the ordered triples 

(a, c', b (b, a', c'}, (c, b', a'}, (a', b', c'}. (2) 

If J is the value of the integral (1), it follows from 10.9(6) that 

j = i f m dz. (3) 

j= 1 JeA' 

The absolute value of at least one of the integrals on the right of (3) is there- 
fore at least | J/4\. Call the corresponding triangle A l5 repeat the argument 
with A x in place of A, and so forth. This generates a sequence of triangles A n 



206 REAL AND COMPLEX ANALYSIS 

such that AdA 1 dA 2 d ,, * > such that the length of dA n is 2 ~ n L, if L is the 
length of 5 A, and such that 

\J\ < 4" f f(z)dz (n= 1,2,3,...). (4) 

JeA„ 

There is a (unique) point z 0 which the triangles A n have in common. Since A 
is compact, z 0 e A, so /is differentiable at z 0 . 

Let € > 0 be given. There exists an r > 0 such that 

I f(z) - /(z o) - /'(zo)(z -z 0 )l <e|z-2 0 l (5) 

whenever | z — z 0 1 < r, and there exists an n such that | z — z 0 1 < r for all 
z e A„ . For this n we also have | z — z 0 1 < 2 ~ n L for all z e A„ . By the Corol- 
lary to Theorem 10.12, 

f /(z) dz = f [/(z) -/(z 0 ) -f'(z 0 )(z - z 0 )] (6) 

J0A„ JdA„ 

so that (5) implies 

I f f(z)dz < e(2~ n L) 2 , (7) 

I Je a„ 

and now (4) shows that \J \ < el}. Hence J =? 0 if p $ A. 

Assume next that p is a vertex of A, say p = a. If a, b, and c are collinear, 
then (1) is trivial, for any continuous /. If not, choose points x e [a, b] and 
y e [a, c], both close to a, and observe that the integral of / over dA is the 
sum of the integrals over the boundaries of the triangles (a, x, y} 9 (x, b , y}, 
and (h, c, y}. The last two of these are 0, since these triangles do not contain 
p. Hence the integral over dA is the sum of the integrals over [a, x], [x, y], 
and [y, a], and since these intervals can be made arbitrarily short and / is 
bounded on A, we again obtain (1). 

Finally, if p is an arbitrary point of A, apply the preceding result to 
(a, b , p}, { b , c, p}, and (c, a, p} to complete the proof. hh 

10.14 Cauchy’s Theorem in a Convex Set Suppose Q is a convex open set , 
peQ, / is continuous on Q, and f e — {p}). T/ien /= F' for some 
F g H(Q). Hence 

(f(z)dz = 0 (1) 


for every closed path y in Q. 
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Proof Fix a e Q. Since Q is convex, Q contains the straight line interval 
from a to z for every z e ft, so we can define 

m = f m (z e o). (2) 

J[fl, z) 

For any z and z Q e Q, the triangle with vertices at a, z 0 , and z lies in Q; 
hence F(z) — F(z 0 ) is the integral of / over [z 0 , z], by Theorem 10.13. Fixing 
z 0 , we thus obtain 


F(z)-F(z 0 ) „ , 1 

— : — : — -7 (*<>) = - — 


C/(0 -/( z o)3 dt, 

[zo, z] 


( 3 ) 


if z ^ z 0 . Given € > 0, the continuity of / at z 0 shows that there is a 5 > 0 
such that |/(^) — /(z 0 ) | < 6 if | ^ — z 0 1 < d ; hence the absolute value of the 
left side of (3) is less than 6 as soon as | z — z 0 1 < 5. This proves that / = F'. 
In particular, F e H(Q). Now (1) follows from Theorem 10.12. //// 


10.15 Cauchy’s Formula in a Convex Set Suppose y is a closed path in a 
convex open set Q, and f e H(Q). IfzeQ, and z $ y*, then 

at 

The case of greatest interest is, of course, Ind y (z) = 1. 

Proof Fix z so that the above conditions hold, and define 

if £ 6 fl, £ # Z, 

g(Z) = j (2) 

|/'(z) if Z = z. 

Then g satisfies the hypotheses of Theorem 10.14. Hence 

If we substitute (2) into (3) we obtain (1). //// 

The theorem concerning the representability of holomorphic functions by 
power series is an easy consequence of Theorem 10.15, if we take a circle for y: 


10.16 Theorem For every open set Q in the plane , every / e H(Q) is represent- 
able by power series in Q. 
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Proof Suppose / e H(Q) and D(a; R) cz Q. If y is a positively oriented circle 
with center at a and radius r < R, the convexity of D(a; R) allows us to apply 
Theorem 10.15; by Theorem 10.11, we obtain 

/(Z) = 2 (zeD(a;r)). ( 1 ) 

But now we can apply Theorem 10.7, with X = [0, 2k], cp = y, and 
d/i(t) = f(y(t))y'(t) dt , and we conclude that there is a sequence { c n } such that 

/(z) = Z c »(z - a ) B ( z e r ))- (2) 

n = 0 

The uniqueness of {c„} (see the Corollary to Theorem 10.6) shows that the 
same power series is obtained for every r < R (as long as a is fixed). Hence 
the representation (2) is valid for every z e D(a ; R ), and the proof is complete. 

//// 


Corollary Iffe H(Q), then f e H(Q). 

Proof Combine Theorems 10.6 and 10.16. //// 

The Cauchy theorem has a useful converse: 

10.17 Morera’s Theorem Suppose f is a continuous complex function in an 
open set Q such that 


f(z)dz = 0 
JdA 

for every closed triangle AcQ, Then f e H(Q). 

Proof Let V be a convex open set in Q. As in the proof of Theorem 10.14, 
we can construct F e H(V) such that F' = /. Since derivatives of holomorphic 
functions are holomorphic (Theorem 10.16), we have / e H(V), for every 
convex open Fcfi, hence / e H(Q). //// 

The Power Series Representation 

The fact that every holomorphic function is locally the sum of a convergent 
power series has a large number of interesting consequences. A few of these are 
developed in this section. 


10.18 Theorem Suppose Q is a region, f e H(Q), and 

Z(f) = {a 6 £i:f{a) = 0}. 


( 1 ) 
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Then either Z(f) = Q , or Z(f) has no limit point in Q. In the latter case there 
corresponds to each a e Z(f) a unique positive integer m = m(a) such that 

f(z) = (z - a) m g{z) (z e Cl), (2) 

where g e H(Q) and g(a) i=- 0; furthermore , Z(f) is at most countable . 


(We recall that regions are connected open sets.) 

The integer m is called the order of the zero which / has at the point a. 
Clearly, Z(/) = Q if and only if / is identically 0 in Q. We call Z(/) the zero set of 
f Analogous results hold of course for the set of a-points of f i.e., the zero set of 
/ — a, where a is any complex number. 


Proof Let A be the set of all limit points of Z(/) in Q. Since / is continuous, 
A c Z(/). 

Fix a e Z(f) 9 and choose r > 0 so that D(a ; r) c= Q. By Theorem 10.16, 


OO 


/(z)= Z C n( Z ~ «)" 
n = 0 


(z e D(a ; r)). 


(3) 


There are now two possibilities. Either all c„ are 0, in which case D(a; r) c A 
and a is an interior point of A, or there is a smallest integer m [necessarily 
positive, since f(a) = 0] such that c m 0. In that case, define 


j(z - a) m f{z) (z eCl- {a}), 
(c m (z = a). 


(4) 


Then (2) holds. It is clear that g e H(Cl — {a}). But (3) implies 


0(z) = 


OO 


I c m+k {z - a) k 

k = 0 


(z e D(a ; r)). 


(5) 


Hence g e H(D(a; r)), so actually g e H(Q). 

Moreover, g(a) i=- 0, and the continuity of g shows that there is a neigh- 
borhood of a in which g has no zero. Thus a is an isolated point of Z(/), by 
( 2 ). 

If a e A, the first case must therefore occur. So A is open. If B = Q — A, 
it is clear from the definition of A as a set of limit points that B is open. Thus 
Q is the union of the disjoint open sets A and B. Since Q is connected, we 
have either A = Q, in which case Z(/) = Q, or A = 0. In the latter case, 
Z(/) has at most finitely many points in each compact subset of Q, and since 
Q is a-compact, Z(/) is at most countable. //// 


Corollary Iff and g are holomorphic functions in a region Q and if f(z) = g(z) 
for all z in some set which has a limit point in Q, then f(z) = g{z)for all zeQ. 



a a 
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In other words, a holomorphic function in a region Q is determined by its 
values on any set which has a limit point in Q. This is an important uniqueness 
theorem. 

Note: The theorem fails if we drop the assumption that Q is connected: If 
= Q 0 u O l9 and Q 0 and are disjoint open sets, put f = 0 in Q 0 and / = 1 in 

i* 


10.19 Definition If a e Q and / e H(Q — {a}), then / is said to have an iso- 
lated singularity at the point a. If / can be so defined at a that the extended 
function is holomorphic in Q, the singularity is said to be removable. 

10.20 Theorem Suppose f e H(Q — {a}) and f is bounded in D'(a ; r\for some 
r > 0. Then f has a removable singularity at a. 

Recall that D'(a ; r) = {z: 0 < | z — a \ < r}. 

Proof Define h(a) = 0, and h(z) = (z — a) 2 f(z) in Q — {a}. Our boundedness 
assumption shows that h'(a) = 0. Since h is evidently differentiable at every 
other point of Q, we have h e H(Q), so 

h(z) = X c„(z - a)" (z e D(a; rj). 

n — 2 

We obtain the desired holomorphic extension of /by setting /(a) = c 2 , for 
then 


/(z) = 


OO 


Z c„ + 2 (z - a) n 

n = 0 


(z 6 D(a; r)). 


Ill 1/ 


10.21 Theorem If a e Cl, and f e H(Cl — {a}), then one of the following three 
cases must occur: 

(a) / has a removable singularity at a. 

(b) There are complex numbers c t , ..., c m , where m is a positive integer and 
c m 7^ 0, such that 


/(z) - 


z 


k=l 


c k 

(z - af 


has a removable singularity at a. 

(c) If r > 0 and D(a; r) a Q, then f(D'(a ; r)) is dense in the plane. 


In case (h),/ is said to have a pole of order m at a. The function 


Z C k( z ~ a ) k , 


k= 1 
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a polynomial in (z — a) -1 , is called the principal part of / at a. It is clear in this 
situation that | /(z) | — ► oo as z — ► a. 

In case (c), / is said to have an essential singularity at a. A statement equiva- 
lent to (c) is that to each complex number w there corresponds a sequence {z„} 
such that z n — * a and /(zj— ► w as n— ► oo. 

Proof Suppose ( c ) fails. Then there exist r > 0, S > 0, and a complex 
number w such that | f(z) — w \ > S in D'(a; r). Let us write D for D(a ; r) and 
D' for D'(a ; r). Define 


0(2) = 777 (z e £>')• (1) 

f(z) - w 

Then g e H(D') and \g\ < 1 /S. By Theorem 10.20, g extends to a holo- 
morphic function in D . 

If g(a) # 0, (1) shows that / is bounded in D'(a ; p) for some p > 0. Hence 
(a) holds, by Theorem 10.20. 

If g has a zero of order m > 1 at a, Theorem 10.18 shows that 


gf(z) = (z - a) M 0 ,(z) (z e D), (2) 

where g t e H(D) and gr,(a) # 0. Also, </, has no zero in Z)', by (1). Put h = 
1/gfj in D. Then h e H(D), h has no zero in D, and 

f(z) -W~(z- a)~ m h(z) (z e £>'). (3) 

But h has an expansion of the form 

h(z) = X b n (z - a) n (z e D), (4) 

n = 0 

with b 0 # 0. Now (3) shows that ( b ) holds, with c k = b m _ k ,k = 1, . . . , m. 

This completes the proof. //// 


We shall now exploit the fact that the restriction of a power series 
£ c n (z — a) n to a circle with center at a is a trigonometric series. 

10.22 Theorem If 

f(z) = X c„(z - a)" (z e D(a; R)) (1) 

n — 0 

and if0<r<R , r/ien 

I lc.lV- 7 - f“ \f(a + re i9 )\ 2 d6. 

n = 0 2 71 J-n 


(2) 



212 REAL AND COMPLEX ANALYSIS 


Proof We have 

f(a + re w ) = £ c„ r " e ‘" e - (3) 

n = 0 

For r < R, the series (3) converges uniformly on [ — n, n']. Hence 


1 


c m r" = — | f(a + re ie )e~ ine d6 (n = 0, 1, 2, . . .), (4) 

K 

and (2) is seen to be a special case of Parseval’s formula. //// 

Here are some consequences : 

10.23 Liouville’s Theorem Every bounded entire function is constant . 

Recall that a function is entire if it is holomorphic in the whole plane. 

Proof Suppose / is entire, | f(z) \ < M for all z, and f(z) = £ c n z n for all z. 
By Theorem 10.22, 


Z |c„|V"<M 2 

n = 0 

for all r, which is possible only if c n = 0 for all n > 1. //// 

10.24 The Maximum Modulus Theorem Suppose Q is a region ,/ e H(Q), and 
D(a ; r) c Q. Then 

| f(a) | < max \f(a + re ie ) | . (1) 

e 


Equality occurs in ( 1) if and only if f is constant in Q. 

Consequently, | / 1 has no local maximum at any point of Q, unless / is con- 
stant. 

Proof Assume that | f(a + re id ) \ <\f (a) \ for all real 6 . In the notation of 
Theorem 10.22 it follows then that 

Z I c n\ 2 r 2n < I f{a) | 2 = I C 0 1 2 . 

n = 0 

Hence = c 2 = c 3 = • = 0, which implies that /(z) =f{a) in D(a ; r). Since 

Q is connected, Theorem 10.18 shows that /is constant in Q. //// 

Corollary Under the same hypotheses , 

I f(a) | > min \f(a + re i9 ) \ (2) 

e 


if f has no zero in D(a; r). 



ELEMENTARY PROPERTIES OF HOLOMORPHIC FUNCTIONS 213 


Proof If f(a + re 10 ) = 0 for some 0 then (2) is obvious. In the other case, 
there is a region Q 0 that contains D(a ; r) and in which / has no zero; hence 
(1) can be applied to l/f and (2) follows. //// 


10.25 Theorem If n is a positive integer and 

P(z) = z n H- a n _ ±z n 1 H- • * • H- a H- a 0 , 

where a 0 , . . . , a n _ x are complex numbers , t/ien P has precisely n zeros in the 
plane. 

Of course, these zeros are counted according to their multiplicities : A zero of 
order m, say, is counted as m zeros. This theorem contains the fact that the 
complex field is algebraically closed, i.e., that every nonconstant polynomial with 
complex coefficients has at least one complex zero. 

Proof Choose r > 1 + 2 1 a 0 1 + | cq | + • • • + | x | . Then 
| P(re i0 ) | > | P(0) | (0 < 0 < 2n). 

If P had no zeros, then the function /= 1/P would be entire and would 
satisfy |/(0)| > \f(re i0 )\ for all 0, which contradicts the maximum modulus 
theorem. Thus P(zi) = 0 for some z v Consequently, there is a polynomial Q, 
of degree n — 1, such that P(z) = (z — z l )Q(z). The proof is completed by 
induction on n. //// 


10.26 Theorem (Cauchy’s Estimates) If f e H(D(a; R )) and |/(z)| < M for all 
z e D(a ; R\ then 


n ! M 

\f M (a)\<— (n = 1,2,3,...). (1) 


Proof For each r < R, each term of the series 10.22(2) is bounded above by 

M 2 . //// 

If we take a = 0, R = 1, and f(z) = z w , then M = 1, f {n \ 0) = nl, and we see 
that (1) cannot be improved. 


10.27 Definition A sequence {/,} of functions in Q is said to converge to f 
uniformly on compact subsets of Q if to every compact K Q and to every 
e > 0 there corresponds an N = N(K 9 e) such that | ffz) — f(z)\ < e for all 
zeKiij>N. 

For instance, the sequence { z n } converges to 0 uniformly on compact 
subsets of D( 0; 1), but the convergence is not uniform in D( 0; 1). 
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It is uniform convergence on compact subsets which arises most natu- 
rally in connection with limit operations on holomorphic functions. The term 
“ almost uniform convergence ” is sometimes used for this concept. 

10.28 Theorem Suppose f e H(Q\for j = 1, 2, 3, ..., and /}— ►/ uniformly on 
compact subsets of Q. Then f e H(Q\ and /'■— > /' uniformly on compact subsets 

of a. 

Proof Since the convergence is uniform on each compact disc in Q, / is 
continuous. Let A be a triangle in Q. Then A is compact, so 

I / (z) dz = lim j fj{z) dz = 0, 

Jd A j-* oo J0A 

by Cauchy’s theorem. Hence Morera’s theorem implies that / e H(Q). 

Let K be compact, K c= Q. There exists an r > 0 such that the union E of 
the closed discs D(z; r), for all z e K, is a compact subset of Q. Applying 
Theorem 10.26 to/ — we have 

|/'(z)-//z)|<r- 1 ||/-/ i l| £ (zeK), 

where \\f\\ E denotes the supremum of | / 1 on E. Since f } — > / uniformly on E, it 
follows that f'j-+f' uniformly on K. //// 

Corollary Under the same hypothesis, f ( p^>f {n) uniformly, as j—> oo, on every 
compact set K a Q, and for every positive integer n. 

Compare this with the situation on the real line, where sequences of infinitely 
differentiable functions can converge uniformly to nowhere differentiable func- 
tions ! 


The Open Mapping Theorem 

If Q is a region and f e H(Q), then f (Q) is either a region or a point . 


This important property of holomorphic functions will be proved, in more 
detailed form, in Theorem 10.32. 


10.29 Lemma Iff e H(Q) and g is defined in Q x Q by 

I f(z) ~/(w) 

9{z, w) = \ 


z — w 
f\z) 


if w 7 ^ z, 
if w = z, 


then g is continuous in Q x Q. 
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Proof The only points (z, w) e H x Q at which the continuity of g is poss- 
ibly in doubt have z = w. 

Fix a e Q. Fix € > 0. There exists r > 0 such that D(a; r) a Q and 

I /'( 0 — f'ip) | < € for all C e D(a ; r). If z and w are in D(a ; r) and if 

C(t) = (1 — t)z + tw, 

then C(0 e D(a ; r) for 0 < t < 1, and 

g(z, w) - g(a, a) = j* [/'(C(0) -/Ml <*f. 

The absolute value of the integrand is < €, for every t. Thus 

| g(z , w) — #(a, a) | < €. This proves that g is continuous at (a, a). //// 

10.30 Theorem Suppose (p e H(Q), z 0 e Q, and (p'(z 0 ) ^ 0. TTien Q contains a 
neighborhood V of z 0 such that 

(a) cp is one-to-one in V , 

(i b ) IF = cp(V) is an open set , and 

(c) if ij/: V is defined by i l/((p(z)) = z, tften ij/ e H(IF). 

Thus <p : F — > IF has a holomorphic inverse. 

Proof Lemma 10.29, applied to cp in place of f shows that Q contains a 
neighborhood F of z 0 such that 

| q>{z Y ) - <p(z 2 ) | > i | <p'(z 0 ) I Ul - 1 (1) 

if Zj g F and z 2 e F. Thus (a) holds, and also 

p'(z) ^0 (z e F). (2) 

To prove (b), pick a e V and choose r > 0 so that D{a , r) F. By (1) 
there exists c > 0 such that 

| (p{a + re 10 ) — <p(a) | > 2c ( — 7r < 6 < n). (3) 

If X e D(<p(a); c), then 1 2 — (p{a ) | < c, henCe (3) implies 

min | X — (p{a 4- re 10 ) | > c. (4) 

e 

By the corollary to Theorem 10.24, X — ip must therefore have a zero in 
D(a ; r). Thus X = cp(z) for some z e D{a ; r) F. 

This proves that D((p(a); c) c= (p(V). Since a was an arbitrary point of F, 
(p{ V) is open. 

To prove (c), fix w x e IF. Then q>{z Y ) = w x for a unique z t e F. If w e IF 
and i/^(w) = z e F, we have 

tfr(w) - ^(Wj) _ z - Zj 
W-Wi q>(z) - ^(Zj)' 


(5) 
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By (1), z— ^ when w— ► w v Hence (2) implies that i/f'(wi) = l/<p'( z i)- Thus 
e H(W). //// 

10.31 Definition For m = 1, 2, 3, we denote the “m th power function” 
z — ► z m by 7r m . 

Each w 7 ^ 0 is 7r m (z) for precisely m distinct values of z : If w = re ld , r > 0, 
then n m (z) = w if and only if z = r 1 /m e l(e + 2k7c)/m , k— 1 , . . . , m. 

Note also that each is an open mapping: If K is open and does not 
contain 0, then n m (V) is open by Theorem 10.30. On the other hand, 
n m (D(0; r)) = D( 0; r"). 

Compositions of open mappings are clearly open. In particular, n m o q> is 
open, by Theorem 10.30, if cp' has no zero. The following theorem (which 
contains the more detailed version of the open mapping theorem that was 
mentioned prior to Lemma 10.29) states a converse: Every nonconstant holo- 
morphic function in a region is locally of the form n m o cp, except for an 
additive constant. 

10.32 Theorem Suppose Q is a region , / e H( Q), / is not constant , z 0 e Q, and 
w 0 = f (z 0 ). Let m be the order of the zero which the function f — w 0 has at z 0 . 

Then there exists a neighborhood V of z 0 , FcQ, and there exists 
cp e H(V), such that 

(a) f(z) = w 0 + l(p(z)YforallzeV 9 

(b) cp' has no zero in V and cp is an invertible mapping of V onto a disc D( 0; r). 

Thus / — vv 0 = n m o cp in V. It follows that / is an exactly m-to-1 mapping of 
V — {z 0 } onto D'(w 0 ; r m ), and that each w 0 e f(Q) is an interior point of /( Q). 
Hence / (Q) is open. 

Proof Without loss of generality we may assume that Q is a convex neigh- 
borhood of z 0 which is so small that /(z) ^ vv 0 if z e Q — {z 0 }. Then 

/(z) - w 0 = (z - z 0 ) m g(z) (z G Q) (1) 

for some g e H( Q) which has no zero in Q. Hence gfg e H( Q). By Theorem 
10.14, g' / g = h! for some h e H( Q). The derivative of g • exp ( — h) is 0 in Q. If 
h is modified by the addition of a suitable constant, it follows that 
g = exp (h). Define 


(p(z) = (z — z 0 ) exp (z e Q). 


( 2 ) 


Then (a) holds, for all z e Q. 

Also, (p(z 0 ) = 0 and (p'(z 0 ) ^ 0. The existence of an open set V that 
satisfies ( b ) follows now from Theorem 10.30. This completes the proof. //// 
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The next theorem is really contained in the preceding results, but it seems 
advisable to state it explicitly. 

10.33 Theorem Suppose Q is a region ,/ e H( Q), and f is one-to-one in Q. Then 
f'(z) i=- 0 for every zeQ, and the inverse of f is holomorphic. 

Proof If f'(z 0 ) were 0 for some z 0 e Q, the hypotheses of Theorem 10.32 
would hold with some m > 1, so that / would be m-to-1 in some deleted 
neighborhood of z 0 . Now apply part (c) of Theorem 10.30. //// 

Note that the converse of Theorem 10.33 is false: If /(z) = e z , then f'(z) j=- 0 
for every z, but /is not one-to-one in the whole complex plane. 

The Global Cauchy Theorem 

Before we state and prove this theorem, which will remove the restriction to 
convex regions that was imposed in Theorem 10.14, it will be convenient to add a 
little to the integration apparatus which was sufficient up to now. Essentially, it is 
a matter of no longer restricting ourselves to integrals over single paths, but to 
consider finite “sums” of paths instead. A simple instance of this occurred 
already in Sec. 10.9(c). 

10.34 Chains and Cycles Suppose y l9 ..., y n are paths in the plane, and put K = 
yf u • • • u y*. Each y t induces a linear functional y f on the vector space C(K\ by 
the formula 


Define 


Uf) = 


f(z) dz. 

Jyi 


(i) 


r = y, + ••• + ?„. (2) 

Explicitly, f(/ ) = y^f) + • • • + y„(f) for all / e C(K). The relation (2) suggests 
that we introduce a “ formal sum ” 


r = y t + • • • + y„ 

and define 

j/(z) dz = f(/). 

Then (3) is merely an abbreviation for the statement 

f f(z) dz = t f /(*) dz (feC(K)). 
Jr i=l Jyi 

Note that (5) serves as the definition of its left side. 


( 3 ) 

( 4 ) 

( 5 ) 
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The objects T so defined are called chains. If each yj in (3) is a closed path, 
then T is called a cycle. If each y } in (3) is a path in some open set Q, we say that 
T is a chain in Q. 

If (3) holds, we define 


r* = yf u • * • u y*. (6) 

If T is a cycle and a £ T*, we define the index of a with respect to T by 


Ind r (a) = f , 
2ni J r z — a 

just as in Theorem 10.10. Obviously, (3) implies 

Ind r (a) = £ Ind„ (a). 


( 7 ) 


( 8 ) 


If each y t in (3) is replaced by its opposite path (see Sec. 10.8), the resulting 
chain will be denoted by — T. Then 


1 


/(z) dz = - 


f(z)dz (f e C(T*)). 


( 9 ) 


In particular, Ind_ r (a) = — Ind r (a) if T is a cycle and a T*. 

Chains can be added and subtracted in the obvious way, by adding or sub- 
tracting the corresponding functionals: The statement T = Tj + T 2 means 


/(z) dz 


f f(z)dz+ f f(z)dz 
Jr i Jr 2 


(10) 


for every / e C(r? u Tf ). 

Finally, note that a chain may be represented as a sum of paths in many 
ways. To say that 

4- • * • + y„ = 4- • • • -f S k 


means simply that 



/(z) dz = 


I 


/(z) dz 
J> j 


for every / that is continuous on yf u • • • u y* u Sf u • • • u d$. In particular, a 
cycle may very well be represented as a sum of paths that are not closed. 


10.35 Cauchy’s Theorem Suppose f e H( Q), where Q is an arbitrary open set in 
the complex plane. If T is a cycle in Q that satisfies 


Ind r (a) = 0 for every a not in Q, 


(1) 
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then 


m ■ Ind; 


and 


2m J r w — z 


1 


dw for z e Q — T* 


/(z) dz = 0. 

lfF 0 and r x are cycles in Q such that 

Ind ro (a) = Ind ri (a) for every a not in Q, 


then 


f f(z) dz = f f(z) dz. 

Jr 0 Jri 


(2) 


( 3 ) 


( 4 ) 


( 5 ) 


Proof The function g defined in Q x Q by 

|/(w)-/( 2 ) 

g(z, w) = 


if w 2, 


w — z 

I f'(z) if w = z, 

is continuous infi x fi (Lemma 10.29). Hence we can define 


h(z) = 

2m 


g(z , w) dw (z e Q). 


(6) 


( 7 ) 


For z e Q - T*, the Cauchy formula (2) is clearly equivalent to the assertion 
that 


m = o. (8) 

To prove (8), let us first prove that h e H( Q). Note that g is uniformly 
continuous on every compact subset of Q x Q. If z e Q, z„ e Q, and z n — ► z, it 
follows that g(z n , w)— ► g(z, w) uniformly for weP (a compact subset of Q). 
Hence h(z n )^> h(z). This proves that h is continuous in Q. Let A be a closed 
triangle in Q. Then 


L 


* 2) * - 25 


g(z , w) 6?z ) dw. 


( 9 ) 


For each weQ,z-> g(z , w) is holomorphic in Q. (The singularity at z = w is 
removable.) The inner integral on the right side of (9) is therefore 0 for every 
w e T*. Morera’s theorem shows now that h e H(Zl). 
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Next, we let be the set of all complex numbers z for which Ind r (z) = 
0, and we define 

M*) = ^ f — dw (z e Q t ). (10) 

2m J r w — z 

If 2 6 Q n Q l9 the definition of makes it clear that h t (z) = h(z). Hence 
there is a function cp e H(Q u Q x ) whose restriction to Q is /i and whose 
restriction to is h v 

Our hypothesis (1) shows that contains the complement of Q. Thus cp 
is an entire function. also contains the unbounded component of the com- 
plement of T*, since Ind r (z) is 0 there. Hence 

lim cp(z) = lim h x (z) = 0. (11) 

|z|->oo |z| — *• 00 

Liouville’s theorem implies now that cp(z) = 0 for every z. This proves (8), and 
hence (2). 

To deduce (3) from (2), pick a e Q — T* and define F(z) = (z — a)f(z). 
Then 

f f(z) dz = r” (* dz = F(a) • Indr («) = 0, (12) 

2m J r 27a J r z — a 

because F(a) = 0. 

Finally, (5) follows from (4) if (3) is applied to the cycle T = T x — T 0 . 
This completes the proof. //// 

10.36 Remarks 

(a) If y is a closed path in a convex region Q and if a $ Q, an application of 
Theorem 10.14 to /(z) = (z — a) -1 shows that Ind y (a) = 0. Hypothesis 
(1) of Theorem 10.35 is therefore satisfied by every cycle in Q if Q is 
convex. This shows that Theorem 10.35 generalizes Theorems 10.14 and 
10.15. 

(b) The last part of Theorem 10.35 shows under what circumstances integra- 
tion over one cycle can be replaced by integration over another, without 
changing the value of the integral. For example, let Q be the plane with 
three disjoint closed discs D t removed. If T, y u y 2 , y 3 are positively 
oriented circles in Q such that T surrounds u D 2 u D 3 and y t sur- 
rounds D t but not Dj for j # i, then 

f f(z) dz= X /(z) dz 
Jr i= i Jy; 

for every / e H(Q). 

( c ) In order to apply Theorem 10.35, it is desirable to have a reasonably 
efficient method of finding the index of a point with respect to a closed 
path. The following theorem does this for all paths that occur in practice. 
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It says, essentially, that the index increases by 1 when the path is crossed 
“ from right to left.” If we recall that Ind y (a) = 0 if a is in the unbounded 
component of the complement W of y*, we can then successively deter- 
mine Ind y (a) in the other components of W , provided that W has only 
finitely many components and that y traverses no arc more than once. 

10.37 Theorem Suppose y is a closed path in the plane , with parameter interval 
[a, /?]. Suppose cn<u<v<P,a and b are complex numbers , | b \ = r > 0, and 

(i) y(u) = a — b, y(v) = a + b, 

(ii) | y(s) — a \ < r if and only if u < s < v, 

(iii) | y(s) — a \ — r if and only if s = u or s = v. 

Assume furthermore that D(a; r) — y* is the union of two regions , D + and 
D_ , labeled so that a + bi e D+ and a — bi e D_ . Then 

Indy (z) = 1 + Indy (w) 

if x e D + and weD_. 

As y(t) traverses D(a ; r) from a — b to a + b, D_ is “on the right” and 
D + is “ on the left ” of the path. 


Proof To simplify the writing, reparametrize y so that u = 0 and v = n. 
Define 


f(s) = 


- be is 

(0 < s < 2 n) 

C(s) 

(0 < s < n) 

y(2n — s) 

(n < s < 2n) 

y(s) 

(0 < s < n) 

C(s) 

(n < s < 2n) 

y(s) 

(a < s < 0 or n < s < P) 

C(s) 

(0 < s < n). 


Since y(0) = C(0) and y(n) = C(n\f g , and h are closed paths. 

If E c: D(a; r), | f — a \ = r, and £ £ £, then E lies in the disc D(2a — £; 2r) 
which does not contain £. Apply this to E = g*, £ = a — bi , to see [from 
Remark 10.36(a)] that Ind^ (a — bi) = 0. Since D_ is connected and D_ does 
not intersect g *, it follows that 

Ind^ (w) = 0 i/weD_. (1) 

The same reasoning shows that 


Indy (z) = 0 if z g D+ . 


(2) 
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We conclude that 

Ind y (z) = Ind* (z) = Ind* (w) 

= Ind c (w) + Indy (w) = 1 + Ind y (w). 

The first of these equalities follows from (2), since h = y 4 - / The second holds 
because z and w lie in D(a ; r), a connected set which does not intersect h*. 
The third follows from (1), since h + g = C 4 - y 9 and the fourth is a conse- 
quence of Theorem 10.1 1. This completes the proof. //// 

We now turn to a brief discussion of another topological concept that is 
relevant to Cauchy’s theorem. 

10.38 Homotopy Suppose y 0 and y t are closed curves in a topological space AT, 
both with parameter interval / = [0, 1]. We say that y 0 and y Y are X-homotopic if 
there is a continuous mapping H of the unit square I 2 = I x / into X such that 

H(s 9 0) = y 0 (s) 9 H(s 9 1) = yfs) 9 H( 0, t) = H( 1, t) (1) 

for all s e I and tel. Put y f ( s ) = H(s 9 t). Then (1) defines a one-parameter family 
of closed curves y t in X 9 which connects y 0 and y v Intuitively, this means that y 0 
can be continuously deformed to y l9 within X. 

If y 0 is AT-homotopic to a constant mapping y l (i.e., if yX consists of just one 
point), we say that y 0 is null-homotopic in X. If X is connected and if every closed 
curve in X is null-homotopic, X is said to be simply connected. 

For example, every convex region Q is simply connected. To see this, let y 0 be 
a closed curve in Q, fix z x e Q, and define 

H(s 9 t) = (1 — t)y 0 (s) + tz x (0 < s < 1, 0 < t < 1). (2) 

Theorem 10.40 will show that condition (4) of Cauchy’s theorem 10.35 holds 
whenever T 0 and are Q-homotopic closed paths. As a special case of this, note 
that condition (1) of Theorem 10.35 holds for every closed path T in QifQ is simply 
connected. 

10.39 Lemma If y 0 and y t are closed paths with parameter interval [0, 1], if a 
is a complex number , and if 

\yi(s)-y 0 (s)\<\*-y 0 (s)\ ( 0 <s<i) (l) 

then Ind yi (a) = Indy 0 (a). 

Proof Note first that (1) implies that a £ yg and a £ yf. Hence one can 
define y = (y x — a )/(y 0 — a). Then 

y' _ yj y'o 
y ri - a r 0 - « 


( 2 ) 
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and 1 1 — y | < 1, by (1). Hence y* c= D( 1 ; 1), which implies that Ind y (0) = 0. 


Integration of (2) over [0, 1] now gives the desired result. //// 

10.40 Theorem IfT 0 and r x are Q-homotopic closed paths in a region Q, and if 
oc $ Q, then 

Ind Fl (a) = Ind ro (a). (1) 

Proof By definition, there is a continuous H : I 2 — ► Q such that 

H(s, 0) = r 0 (s), H(s, 1) = r^s), H( 0, t) = H( 1, t). (2) 

Since I 2 is compact, so is H(I 2 ). Hence there exists e > 0 such that 

| a — H(s , t) | > 2e if (s, t) e I 2 . (3) 

Since H is uniformly continuous, there is a positive integer n such that 

| H(s 9 t) - H(s\ t')\ < e if | s - s' | + 1 1 - t' | < 1 fn. (4) 

Define polygonal closed paths ?<>>•••. V. by 

y*(s) = h(^, ^j(ns + 1 - i) + - ns ) ( 5 ) 

if i — 1 < ns < i and i = 1, . . . , n. By (4) and (5), 

I y k (s) - H(S , k/n ) | <6 (fc = 0, ..., n; 0 < s < 1). (6) 

In particular, taking k = 0 and k = n, 

I To( s ) — r o (s) | < e, I y„(s) - r i(s) I < e. (7) 

By (6) and (3), 

I a — y fc (s) | > e (k = 0, . . . , n ; 0 < s ^ 1). (8) 

On the other hand, (4) and (5) also give 

| T k -i(s) — y k (s) | < e (k = 1, n; 0 < s < 1). (9) 

Now it follows from (7), (8), (9), and n + 2 applications of Lemma 10.39 
that a has the same index with respect to each of the paths r 0 , y 0 , y l9 . . . , y n , 
T j. This proves the theorem. //// 


Note : If r,(s) = H(s , t) in the preceding proof, then each r t is a closed curve , 
but not necessarily a path, since H is not assumed to be differentiable. The paths 
y k were introduced for this reason. Another (and perhaps more satisfactory) way 
to circumvent this difficulty is to extend the definition of index to closed curves. 
This is sketched in Exercise 28. 
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The Calculus of Residues 


10.41 Definition A function / is said to be meromorphic in an open set Q if 
there is a set A c= Q such that 

(a) A has no limit point in Q, 

(b) f g H(Q — A\ 

(c) / has a pole at each point of A. 

Note that the possibility A = 0 is not excluded. Thus every / e H(Q) is 
meromorphic in Q. 

Note also that (a) implies that no compact subset of Q contains infinitely 
many points of A , and that A is therefore at most countable. 

If / and A are as above, if a e A, and if 


Q(z) = X c k (z - a)~ k (1) 

k=l 

is the principal part of / at a, as defined in Theorem 10.21 (i.e., if / — Q has a 
removable singularity at a), then the number is called the residue of /at a: 


c t = Res (/; a). 

If T is a cycle and a $ T*, (1) implies 


JL_ 

2ni 


Q(z) dz = c x Ind r (a) = Res ( Q ; a) Ind r (a). 


This very special case of the following theorem will be used in its proof. 


(2) 

(3) 


10.42 The Residue Theorem Suppose f is a meromorphic function in Q. Let A 
be the set of points in Q at which f has poles. IfT is a cycle in Q — A such that 

Ind r (a) = 0 for all a £ Q, (1) 

then 

f f /(z) dz = X Res (/; a) Ind r (a). (2) 

2 nl Jr aeA 


Proof Let B = {a e A : Ind r (a) # 0}. Let W be the complement of T*. Then 
Ind r (z) is constant in each component V of W. If V is unbounded, or if V 
intersects Q c , (1) implies that Ind r (z) = 0 for every z e V. Since A has no 
limit point in Q, we conclude that B is a finite set. 

The sum in (2), though formally infinite, is therefore actually finite. 

Let a i, . . . , a n be the points of B, let Q l9 . . . , Q n be the principal parts of / 
at a l9 . . . , a n , and put g =/ — (Q x + • • • + Q n ). (If 5 = 0, a possibility which 
is not excluded, then g = /.) Put Q 0 = Q — (A — B). Since g has removable 
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singularities at a l9 . .., a„, Theorem 10.35, applied to the function g and the 
open set Q 0 » shows that 

jgto dz = 0. (3) 

Hence 

2^ | /( z ) dz = £ ^ | Qt(z) dz = Res ((?*; a*) Ind r (a k ), 

and since / and Q k have the same residue at a k , we obtain (2). //// 

We conclude this chapter with two typical applications of the residue 
theorem. The first one concerns zeros of holomorphic functions, the second is the 
evaluation of a certain integral. 

10.43 Theorem Suppose y is a closed path in a region Q, such that Ind y (a) = 0 
for every a not in Q. Suppose also that Ind y (a) = 0 or 1 for every a e Q — y* 9 
and let be the set of all a with Ind y (a) = 1. 

For any f e H(Q) let N f be the number of zeros off in counted accord- 
ing to their multiplicities. 

(a) Iff e H(Q) and f has no zeros on y* then 

N '=i,lw> d 2 - lad r m (1) 

where T = / ° y. 

(b) If also g e H(Q) and 

I f(z ) - g(z) | < | f(z) I for all z e y* (2) 


then N g = N f . 

Part (b) is usually called Rouche’s theorem. It says that two holomorphic 
functions have the same number of zeros in if they are close together on the 
boundary of as specified by (2). 


Proof Put cp = f'/f a meromorphic function in Q. If a e Q and / has a zero 
of order m = m(a) at a, then /(z) = (z — a) m h(z\ where h and l/h are holo- 
morphic in some neighborhood V of a. In V — {a}, 


cp(z) = 


m 

m 


m h'(z) 

z — a h(z) 


( 3 ) 


Thus 


Res (cp ; a) = m(a). 


(4) 
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Let A = {a e Q 1 :f (a) = 0}. If our assumptions about the index of y are 
combined with the residue theorem one obtains 


2ni 


C f'( z ) 

77 -r dz = X Res (91 a) = Z m («) = N f 

Jy J \Z) a e A ae A 


This proves one half of (1). The other half is a matter of direct computation: 


Ind r (0) = 


2ni 




n«) 

r(s) 


ds 


JL_ 

2ni 


%2n rm 

Jo f(y(s)) 


y'(s) ds 


f 

_ 2 ni J y 


m 

m 


dz. 


The parameter interval of y was here taken to be [0, 2k]. 

Next, (2) shows that g has no zero on y*. Hence (1) holds with g in place 
of f Put r 0 = g o y. Then it follows from (1), (2), and Lemma 10.39 that 


N g = Ind ro (0) = Ind r (0) = N f 


ih y 


10.44 Problem For real t,find the limit , as A — > 00 , of 


%A sin x 
x 


dx. 


( 1 ) 


Solution Since z -1 • sin z • e itz is entire, its integral over [ — A , A] equals 
that over the path r A obtained by going from — A to — 1 along the real axis, 
from — 1 to 1 along the lower half of the unit circle, and from 1 to A along 
the real axis. This follows from Cauchy’s theorem. r A avoids the origin, and 
we may therefore use the identity 

2 i sin z = e iz — e~ iz 


to see that (1) equals cp A (t + 1) — cp A (t — 1), where 


- <Pa(s) 
n 


J_ f fl! 

~ 2ni ] rA z 


dz. 


( 2 ) 


Complete r A to a closed path in two ways: First, by the semicircle from 
A to — Ai to —A; secondly, by the semicircle from A to Ai to —A. The 
function e lsz /z has a single pole, at z = 0, where its residue is 1. It follows that 


and 


- <Pa(s) 
n 


2k 


f exp (isAe i9 ) dO 


n 




( 3 ) 

( 4 ) 
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Note that 


| exp (isAe 10 ) \ = exp ( — /Is sin 0), 


( 5 ) 


and that this is < 1 and tends to 0 as A — ► oo if s and sin 6 have the same 
sign. The dominated convergence theorem shows therefore that the integral 
in (3) tends to 0 if s < 0, and the one in (4) tends to 0 if s > 0. Thus 


lim <p A (s) 


if s > 0, 
if s < 0, 


and if we apply (6) to s = t + 1 and to s = t — 1, we get 


C A sin : 
lm 

-oo J-A * 


lim 

A 


dx = 


if — 1 < t < 1, 
if It I > 1. 


Since (p A ( 0) = rc/2, the limit in (7) is rc/2 when t = ± 1. 


( 6 ) 

( 7 ) 

//// 


Note that (7) gives the Fourier transform of (sin x)/x. We leave it as an exer- 
cise to check the result against the inversion theorem. 


Exercises 

1 The following fact was tacitly used in this chapter: If A and B are disjoint subsets of the plane, if A 
is compact, and if B is closed, then there exists a <5 > 0 such that | a — ft \ > d for all a e A and /? e B. 
Prove this, with an arbitrary metric space in place of the plane. 

2 Suppose that /is an entire function, and that in every power series 

/(z)= Zc„(z-a)" 

n = 0 

at least one coefficient is 0. Prove that / is a polynomial. 

Hint : nlc„ = f {n \a). 

3 Suppose / and g are entire functions, and \f{z)\<\ g{z) \ for every z. What conclusion can you 
draw? 

4 Suppose / is an entire function, and 


\f{z)\<A + B\ z\ k 

for all z, where A, B, and k are positive numbers. Prove that / must be a polynomial. 

5 Suppose {/„} is a uniformly bounded sequence of holomo'rphic functions in ft such that {/„(z)} 
converges for every z e Q. Prove that the convergence is uniform on every compact subset of ft. 

Hint : Apply the dominated convergence theorem to the Cauchy formula for 

6 There is a region Q that exp {Q) = D{ 1; 1). Show that exp is one-to-one in ft, but that there are 
many such ft. Fix one, and define log z, for | z — 1 1 < 1, to be that w e ft for which e w = z. Prove that 
log' (z) = 1/z. Find the coefficients a n in 


1 

z 


oo 


E a n( Z ~ !)" 
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and hence find the coefficients c n in the expansion 


log z = X c n {z - 1)". 

n = 0 


In what other discs can this be done? 

7 Iff e H(Q), the Cauchy formula for the derivatives of f 




=-[ 
2 ni J r 


m 


(C-r )" +1 


df (n=l, 2,3,...) 


is valid under certain conditions on z and V. State these, and prove the formula. 

8 Suppose P and Q are polynomials, the degree of Q exceeds that of P by at least 2, and the rational 
function R = P/Q has no pole on the real axis. Prove that the integral of R over ( — oo, oo) is 2ni times 
the sum of the residues of R in the upper half plane. [Replace the integral over ( — A, A) by one over a 
suitable semicircle, and apply the residue theorem.] What is the analogous statement for the lower 
half plane? Use this method to compute 



1 + x 4 


dx. 


9 Compute e itx /{\ + x 2 ) dx for real t, by the method described in Exercise 8. Check your answer 
against the inversion theorem for Fourier transforms. 

10 Let y be the positively oriented unit circle, and compute 



11 Suppose a is a complex number, | a | # 1, and compute 



dO 

1 — 2a cos 0 + a 2 


by integrating (z — a) l {z — 1/a) 1 over the unit circle. 
12 Compute 


13 Compute 



(for real t). 


2, 3, 4,...). 


[For even n, the method of Exercise 8 can be used. However, a different path can be chosen, which 
simplifies the computation and which also works for odd n : from 0 to R to R exp (2ni/n) to 0.] 

Answer : ( n/n)/sin {n/n). 

14 Suppose and Q 2 are plane regions,/ and g are nonconstant complex functions defined in 
and Q 2 > respectively, and /((IJcflj, Put h = g ° /. If / and g are holomorphic, we know that h is 
holomorphic. Suppose we know that / and h are holomorphic. Can we conclude anything about gl 
What if we know that g and h are holomorphic? 

15 Suppose Q is a region, (p e H(Q), <p' has no zero in Q, / e H((p(Q)), g =f o q>, z 0 e Q, and w 0 = 
<p(z 0 ). Prove that if / has a zero of order m at w 0 , then g also has a zero of order m atz 0 . How is this 
modified if (p ' has a zero of order k at z 0 ? 
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16 Suppose fi is a complex measure on a measure space X , £2 is an open set in the plane, (p is a 
bounded function onQxX such that (p{z , t) is a measurable function of t , for each z e £2, and (p{z , t) 
is holomorphic in £2, for each t e X. Define 

/(z) = J <p(z, t ) dfl(t) 

for zed. Prove that / e H(d). Hint : Show that to every compact K c d there corresponds a constant 
M < co such that 


(p{z, t) — (p(z 0 , t) 

z z 0 


< M 


( z and z 0 e K, t e X). 


17 Determine the regions in which the following functions are defined and holomorphic: 

m- 


f 1 dt f 00 e' z f 1 e tz 

: 1 , g(z) = dt , h(z) = dt. 

Jo 1 +tz Jo 1 + t 2 J-l 1 +t 2 


Hint : Either use Exercise 16, or combine Morera’s theorem with Fubini’s. 

18 Suppose / e H{d), D(a ; r) a d,y is the positively oriented circle with center at a and radius r, and / 
has no zero on y*. For p = 0, the integral 



m 

m 


z p dz 


is equal to the number of zeros of /in D{a ; r). What is the value of this integral (in terms of the zeros 
of/) for p = 1, 2, 3, ... ? What is the answer if z p is replaced by any (p e H(Q)? 

19 Suppose / e H(U), g e H(U), and neither / nor g has a zero in U. If 



(w = 1, 2, 3, ...) 


find another simple relation between / and g. 

20 Suppose D is a region,/, e H(d) for n = 1, 2 , 3 , ... , none of the functions /„ has a zero in £2, and 
{/„} converges to / uniformly on compact subsets of d. Prove that either / has no zero in £2 or/(z) = 0 
for all z e £2. 

If £2' is a region that contains every /„(£ 2), and if /is not constant, prove that /(£ 2) <= £2'. 

21 Suppose / e H(d), £2 contains the closed unit disc, and |/(z)| < 1 if |z| = 1. How many fixed 
points must / have in the disc? That is, how many solutions does the equation /(z) = z have there? 

22 Suppose / e H(d), £2 contains the closed unit disc, | / (z) | > 2 if | z | = 1 , and /( 0) = 1. Must / have a 
zero in the unit disc? 

23 Suppose P„(z) = 1 + z/1 ! + • • • + z n /n\, Q n (z ) = P„(z) — 1, where n = 1, 2 , 3 , — What can you say 
about the location of the zeros of P„ and Q n for large n? Be as specific as you can. 

24 Prove the following general form of Rouche’s theorem : Let £2 be the interior of a compact set K in 
the plane. Suppose / and g are continuous on K and holomorphic in £2, and |/(z) — g(z)\ < |/(z)| for 
all z e K — £2. Then / and g have the same number of zeros in £2. 

25 Let A be the annulus {z: r t < \ z \ < r 2 }, where r x and r 2 are given positive numbers. 

(a) Show that the Cauchy formula 


m = 


2ni 



no 


dt 


is valid under the following conditions : / e H(A), 


r t + € < | z | < r 2 — €, 
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and 


7i(0 = {'i + €)e it i y 2 (t) = (r 2 - €)e u (0 < t <, 2n). 

(b) Show by means of (a) that every / e H{A) can be decomposed into a sum / =f x +/ 2 , where 

is holomorphic outside D{ 0; rj) and f 2 e H(D( 0; r 2 )); the decomposition is unique if we require that 
fi(z)— >0 as |z|— ► oo. 

(c) Use this decomposition to associate with each / e H(A) its so-called “ Laurent series” 

oo 

Z C„Z" 

- oo 

which converges to / in A. Show that there is only one such series for each /. Show that it converges to 
/ uniformly on compact subsets of A. 

(i d) Iff e H(A) and /is bounded in A, show that the components f x and f 2 are also bounded. 

( e ) How much of the foregoing can you extend to the case r t = 0(or r 2 = oo,or both)? 

(/) How much of the foregoing can you extend to regions bounded by finitely many (more than 
two) circles? 

26 It is required to expand the function 


1 1 
1 — z 2 3 — z 


in a series of the form £ c n z". 

- 00 

How many such expansions are there? In which region is each of them valid? Find the coeffi- 
cients c n explicitly for each of these expansions. 

27 Suppose Q is a horizontal strip, determined by the inequalities a < y < b, say. Suppose / e H(Q), 
and /(z) =/(z + 1) for all z e Q. Prove that / has a Fourier expansion in £2, 

/(*) = f c„e 2 * in! , 

- oo 

which converges uniformly in {z: a -I- € < y < b — c}, for every € > 0. Hint : The map z— ► e 2ltiz con- 
verts / to a function in an annulus. 

Find the integral formulas by means of which the coefficients c n can be computed from / 

28 Suppose T is a closed curve in the plane, with parameter interval [0, In']. Take a £ T*. Approx- 
imate T uniformly by trigonometric polynomials . Show that Ind F|i (a) = Ind Fm (a) if m and n are 
sufficiently large. Define this common value to be Ind F (a). Prove that the result does not depend on 
the choice of {T n }; prove that Lemma 10.39 is now true for closed curves, and use this to give a 
different proof of Theorem 10.40. 

29 Define 



dd 

re ie + z ‘ 


Show that /(z) = z if | z | < 1 and that /(z) = 1/z if | z | > 1. 

Thus / is not holomorphic in the unit disc, although the integrand is a holomorphic function of 
z. Note the contrast between this, on the one hand, and Theorem 10.7 and Exercise 16 on the other. 

Suggestion : Compute the inner integral separately for r < \ z \ and for r > \ z \ . 

30 Let £2 be the plane minus two points, and show that some closed paths T in £2 satisfy assumption 
(1) of Theorem 10.35 without being null-homotopic in £2. 



CHAPTER 

ELEVEN 

HARMONIC FUNCTIONS 


The Cauchy-Riemann Equations 

11.1 The Operators d and d Suppose / is a complex function defined in a plane 
open set Q. Regard / as a transformation which maps Q into R 2 9 and assume that 
/ has a differential at some point z 0 e Q, in the sense of Definition 7.22. For 
simplicity, suppose z 0 = /(z 0 ) = 0. Our differentiability assumption is then equiv- 
alent to the existence of two complex numbers a and /? (the partial derivatives of / 
with respect to x and y at z 0 = 0) such that 

/(z) = ax + fiy + rj(z)z (z = x + iy\ (1) 

where rj(z)^> 0 as z— > 0. 

Since 2x = z + z and 2 iy = z — z, (1) can be rewritten in the form 

f(z) = - ^ z + a z + T]{z)z. (2) 

This suggests the introduction of the differential operators 


a 1 ( d . d\ - \( d . d\ 

~ 2\dx 1 dy} ~2\dx + l dyf 


Now (2) becomes 


— = m o) + mm ■ - + n(z) (z / o). 

z z 


For real z, z/z — 1; for pure imaginary z, z/z = — 1. Hence /(z)/z has a limit 
at 0 if and only if (df)( 0) = 0, and we obtain the following characterization of 
holomorphic functions : 


231 
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11.2 Theorem Suppose f is a complex function in Q that has a differential at 
every point o/Q. Then f e H(Q) if and only if the Cauchy- Riemann equation 

m(z) = o (i) 

holds for every z e Q. In that case we have 

m = (df)(z) (z e Q). (2) 


Iff = u + iv , u and v real, (1) splits into the pair of equations 

u x = v y , u y =-v x 

where the subscripts refer to partial differentiation with respect to the indicated 
variable. These are the Cauchy-Riemann equations which must be satisfied by the 
real and imaginary parts of a holomorphic function. 

11.3 The Laplacian Let / be a complex function in a plane open set Q, such that 
f xx and f yy exist at every point of Q. The Laplacian of /is then defined to be 

A /■=/«+/„. (1) 


If /is continuous in Q and if 


A/=0 (2) 

at every point of Q, then / is said to be harmonic in Q. 

Since the Laplacian of a real function is real (if it exists), it is clear that a 
complex function is harmonic in Q if and only if both its real part and its imaginary 
part are harmonic in Q. 

Note that 

A f=4ddf (3) 

provided that f xy = f yx , and that this happens for all / which have continuous 
second-order derivatives. 

If/ is holomorphic, then df = 0,/has continuous derivatives of all orders, and 
therefore (3) shows : 

11.4 Theorem Holomorphic functions are harmonic. 

We shall now turn our attention to an integral representation of harmonic 
functions which is closely related to the Cauchy formula for holomorphic func- 
tions. It will show, among other things, that every real harmonic function is 
locally the real part of a holomorphic function, and it will yield information 
about the boundary behavior of certain classes of holomorphic functions in open 
discs. 



The Poisson Integral 

11.5 The Poisson Kernel This is the function 
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P r (t ) = X (0 < r < 1, t real). (1) 

— 00 

We may regard P r (t) as a function of two variables r and t or as a family of 
functions of r, indexed by r. 

If z = re 10 (0 < r < 1, 9 real), a simple calculation, made in Sec. 5.24, shows 

that 


P r (0 - t) = Re 



+ z 
— z 


1-r 2 

1—2 r cos (9 — t) + r 2 ' 


( 2 ) 


From (1) we see that 


if." 


.(f) dt = 1 (0 < r < 1). 

From (2) it follows that P r (t) > 0, P r (t) = P r ( — t ), that 

P r (t)<P r (6) (0 < <5 < | f | < 71 ), 

and that 

lim P r (S) = 0 (0 < d < n). 

r-> 1 


( 3 ) 

( 4 ) 

( 5 ) 


These properties are reminiscent of the trigonometric polynomials Q k (t) that 
were discussed in Sec. 4.24. 

The open unit disc D(0; 1) will from now on be denoted by U. The unit circle 
- the boundary of U in the complex plane - will be denoted by T. Whenever it is 
convenient to do so, we shall identify the spaces L P (T) and C(T) with the corre- 
sponding spaces of 27c-periodic functions on R l , as in Sec. 4.23. 

One can also regard P r (9 — t) as a function of z = re 10 and e ix . Then (2) 
becomes 


for ze (/, e i% e T. 


P(z, e ix ) = 


- M 

\e»-z\ 


( 6 ) 


11.6 The Poisson Integral If f e I}(T) and 

P(re ie ) = ^ I' Pr(8 - dt, (1) 

then the function F so defined in U is called the Poisson integral of / We shall 
sometimes abbreviate the relation (1) to 

f = pin 


(2) 
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If/is real, formula 11.5(2) shows that P[/] is the real part of 


_ 1 _ 

2n 


e lt + z 


-n e 


fit) dt. 


( 3 ) 


which is a holomorphic function of z = re 10 in U, by Theorem 10.7. Hence P[/] 
is harmonic in U. Since linear combinations (with constant coefficients) of harmo- 
nic functions are harmonic, we see that the following is true : 


11.7 Theorem If f e I?(T) then the Poisson integral P[/] is a harmonic func- 
tion in U. 


The following theorem shows that Poisson integrals of continuous functions 
behave particularly well near the boundary of U. 


11.8 Theorem Iff e C(T) and if Hf is defined on the closed unit disc U by 


(Hf)(re ie ) = 


[ f(e ie ) if r = 1, 

\Pin(re i9 ) if 0 < r < 1, 


(1) 


then Hfe C(U). 


Proof Since P r (t) > 0, formula 1 1.5(3) shows, for every g e C(T), that 

\Plg~]{re i0 )\<\\g\\ T (0 < r < 1), (2) 

so that 

\\Hg\\ v = \\g\\ T igzC{T)). (3) 

(As in Sec. 5.22, we use the notation \\g\\ E to denote the supremum of \g \ on 
the set E .) If 

g(e ie )= £ (4) 

n = —N 

is any trigonometric polynomial, it follows from 11.5(1) that 

(Hg)(re w ) = £ c n r^e M , (5) 

n= ~N 

so that Hg e C(U). 

Finally, there are trigonometric polynomials g k such that \\g k — f\\ T ~* 0 
as /c— ► oo. (See Sec. 4.24.) By (3), it follows that 

\\Hg k -Hf\\ v =\\H(g k -f)\\ v ^0 (6) 

as k > oo. This says that the functions Hg k e C(U) converge, uniformly on U, 
to Hf. Hence Hf e C(U). //// 
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Note : This theorem provides the solution of a boundary value problem (the 
Dirichlet problem) : A continuous function / is given on T and it is required to find 
a harmonic function F in U “ whose boundary values are /.” The theorem exhibits 
a solution, by means of the Poisson integral of f and it states the relation 
between / and F more precisely. The uniqueness theorem which corresponds to 
this existence theorem is contained in the following result. 


11.9 Theorem Suppose u is a continuous real function on the closed unit disc U, 
and suppose u is harmonic in U. Then (in U) u is the Poisson integral of its 
restriction to T, and u is the real part of the holomorphic function 



'* e il + z 


u(e lt ) dt 


(z e U). 


(1) 


Proof Theorem 10.7 shows that / e H(U). If u Y = Re f then (1) shows that 
u i is the Poisson integral of the boundary values of w, and the theorem will 
be proved as soon as we show that u = u Y . 

Put h = u — u t . Then h is continuous on 0 (apply Theorem 11.8 to u t ), h 
is harmonic in 17, and h = 0 at all points of T. Assume (this will lead to a 
contradiction) that h(z 0 ) > 0 for some z 0 e U. Fix e so that 0 < e < /i(z 0 ), and 
define 


g(z) = h(z) + e}z\ 2 (z e U). (2) 

Then g(z 0 ) > h(z 0 ) > e. Since g e C(U) and since g = e at all points of T, 
there exists a point z l e U at which g has a local maximum. This implies that 
g xx < 0 and g yy < 0 at z v But (2) shows that the Laplacian of g is 4e > 0, and 
we have a contradiction. 

Thus u — u x <0. The same argument shows that u Y — u < 0. Hence u = 
u l9 and the proof is complete. //// 


11.10 So far we have considered only the unit disc U = D( 0; 1). It is clear that 
the preceding work can be carried over to arbitrary circular discs, by a simple 
change of variables. Hence we shall merely summarize some of the results: 

If u is a continuous real function on the boundary of the disc D(a ; R) and if u 
is defined in D(a ; R) by the Poisson integral 


u(a + re w ) = 


_ 1 _ 
2 n 


m " R 2 - r 2 

R 2 — 2 Rr cos (6 — t) - J- r 2 


u(a + Re lt ) dt 


(1) 


then u is continuous on D(a ; R) and harmonic in D(a ; R). 

If u is harmonic (and real) in an open set Q and if D(a ; R) a Q, then u 
satisfies (1) in D(a ; R) and there is a holomorphic function / defined in D(a ; R) 
whose real part is u. This / is uniquely defined, up to a pure imaginary additive 
constant. For if two functions, holomorphic in the same region, have the same 
real part, their difference must be constant (a corollary of the open mapping 
theorem, or the Cauchy-Riemann equations). 
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We may summarize this by saying that every real harmonic function is locally 
the real part of a holomorphic function. 

Consequently, every harmonic function has continuous partial derivatives of 
all orders. 

The Poisson integral also yields information about sequences of harmonic 
functions : 

11.11 Harnack’s Theorem Let {u n } be a sequence of harmonic functions in a 

region Q. 


(a) If — ► w uniformly on compact subsets of Q, then u is harmonic in Q. 

(b) If Wj < u 2 < u 3 < • • • , then either {u n } converges uniformly on compact 
subsets of Q, or u n (z) —> oo for every zeQ. 


Proof To prove (a), assume D(a ; R) a Q, and replace u by u n in the Poisson 
integral 11.10(1). Since ► u uniformly on the boundary of D(a; R\ we con- 
clude that u itself satisfies 11.10(1) in D(a ; R). 

In the proof of (b\ we may assume that u x > 0. (If not, replace u n by 
u n — u i .) Put u = sup u n , let A — {z e Q: u(z) < oo}, and B — Q — A. Choose 
D{a ; R) c z Q. The Poisson kernel satisfies the inequalities 


R — r R 2 — r 2 R + r 

R + r ~ R 2 — 2rR cos (6 — t) A r ~ R — r 


for 0 < r < R. Hence 


R - r 
R + r 


u n (a) < u„(a + re ie ) < 


R + r 
R - r 


u„(a). 


The same inequalities hold with u in place of u n . It follows that either 
u(z) = oo for all z e D(a ; R) or u(z) < oo for all z e D(a ; R). 

Thus both A and B are open; and since Q is connected, we have either 
A = 0 ( in which case there is nothing to prove) or A = Q. In the latter case, 
the monotone convergence theorem shows that the Poisson formula holds 
for u in every disc in Q. Hence u is harmonic in Q. Whenever a sequence of 
continuous functions converges monotonically to a continuous limit, the con- 
vergence is uniform on compact sets ([26], Theorem 7.13). This completes the 
proof. //// 
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The Mean Value Property 

11.12 Definition We say that a continuous function u in an open set Q has 
the mean value property if to every z g Q there corresponds a sequence { r n } 
such that r n > 0, r n -> 0 as n-> oo, and 

M(z) = J* u(z + r„ e“) dt (n = 1, 2, 3, . . .). (1) 

In other words, u(z) is to be equal to the mean value of u on the circles of 
radius r n and with center at z. 

Note that the Poisson formula shows that (1) holds for every harmonic 
function w, and for every r such that D(z ; r) c= Q. Thus harmonic functions 
satisfy a much stronger mean value property than the one that we just 
defined. The following theorem may therefore come as a surprise : 

11.13 Theorem If a continuous function u has the mean value property in an 
open set Q, then u is harmonic in Q. 

Proof It is enough to prove this for real u. Fix D(a ; R) c= Q. The Poisson 
integral gives us a continuous function h on D(a ; R) which is harmonic in 
D(a; R) and which coincides with u on the boundary of D(a; R). Put 
v = u — h 9 and let m = sup {v(z): z g D(a ; R)}. Assume m > 0, and let E be 
the set of all z g D(a ; R) at which v(z) — m. Since v = 0 on the boundary of 
D(a ; R\ E is a compact subset of D(a ; R). Hence there exists a z 0 e E such 
that 


| z 0 - a | > | z - a | 

for all z g E. For all small enough r, at least half the circle with center z 0 and 
radius r lies outside £, so that the corresponding mean values of v are all less 
than m = v(z 0 ). But v has the mean value property, and we have a contradic- 
tion. Thus m = 0, so v < 0. The same reasoning applies to — v. Hence v = 0, 
or u = h in D(a ; R ), and since D(a ; R) was an arbitrary closed disc in Q, u is 
harmonic in Q. //// 

Theorem 11.13 leads to a reflection theorem for holomorphic functions. By 
the upper half plane n + we mean the set of all z = x + iy with y > 0; the lower 
half plane n “ consists of all z whose imaginary part is negative. 

11.14 Theorem (The Schwarz reflection principle) Suppose L is a segment of 
the real axis , Q + is a region in n + , and every t g Lis the center of an open disc 
D t such that Tl + n D t lies in Q + . Let Q ~ be the reflection of Q + : 

Q" = {z: z g Q + }. 


(1) 
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Suppose f = u + iv is holomorphic in Q + , and 

lim v(z n ) = 0 (2) 

n-+ oo 

for every sequence {z n } in Q + which converges to a point of L. 

Then there is a function F, holomorphic in Q + u L u Q~, such that 
F(z) = / (z) in Q + ; this F satisfies the relation 

F(z) = F(z) (zeQ+uLuQ"). (3) 

The theorem asserts that / can be extended to a function which is holo- 
morphic in a region symmetric with respect to the real axis, and (3) states that F 
preserves this symmetry. Note that the continuity hypothesis (2) is merely 
imposed on the imaginary part of / 

Proof Put Q = Q + uLuQ~. We extend v to Q by defining i;(z) — 0 for 
z e L and v(z) = —v(z) for z e Q~. It is then immediate that v is continuous 
and that v has the mean value property in Q, so that v is harmonic in Q, by 
Theorem 11.13. 

Hence v is locally the imaginary part of a holomorphic function. This 
means that to each of the discs D t there corresponds an f e H(D t ) such that 
Im f = v. Each f is determined by v up to a real additive constant. If this 
constant is chosen so that / f (z) =/(z) for some z e D t n II + , the same will 
hold for all z e D t n II + , since f — fi is constant in the region D t n II + . We 
assume that the functions f are so adjusted. 

The power series expansion of f in powers of z — t has only real coeffi- 
cients, since v = 0 on L, so that all derivatives of f are real at t. It follows 
that 


/,(z) = f,(z) (z 6 D,). 


( 4 ) 


Next, assume that D s n D, # 0. Then f, = / =f s in D, n D s n II + ; and 
since D, n D s is connected, Theorem 10.18 shows that 


/r(z) =/ s (z) (z e D, n D s ). 


( 5 ) 


Thus it is consistent to define 


F(z) = 


f(z) 

m 

m 


for zgQ + 
for z e D t 
for z e Q“ 


( 6 ) 


and it remains to show that F is holomorphic in Q .If D(a ; r) c= Q , then 
D(a; r) c= Q + , so for every z e D(a ; r) we have 


/(*) = 


00 


Z c n(z - a)". 


( 7 ) 
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Hence 


F(z) = £ c„(z - a) n (z e D{a ; r)). (8) 

n = 0 

This completes the proof. //// 

Boundary Behavior of Poisson Integrals 

11.15 Our next objective is to find analogues of Theorem 11.8 for Poisson inte- 
grals of Zf-functions and measures on T. 

Let us associate to any function u in U a family of functions u r on T, defined 
by 

u r (e id ) = u(re id ) (0 < r < 1). (1) 

Thus u r is essentially the restriction of u to the circle with radius r, center 0, but 
we shift the domain of u r to T. 

Using this terminology, Theorem 11.8 can be stated in the following form: 

If f e C(T) and F = P[/], then F r -+f uniformly on T, as r — > 1. In other 
words, 

lim \\F r “/|| a, = 0, (2) 

r-l 

which implies of course that 

lim F r (e‘°)=f{e i9 ) (3) 


at every point of T. 

As regards (2), we shall now see (Theorem 11.16) that the corresponding 
norm-convergence result is just as easy in II . Instead of confining ourselves to 
investigating radial limits , as in (3), we shall then study nontangential limits of 
Poisson integrals of measures and II- functions; the differentiation theory devel- 
oped in Chap. 7 will play an essential role in that study. 

11.16 Theorem 7/1 < p < oo ,/ e L f(T), and u = P[/], then 

\\u r \\ p <\\f\\ p (0 < r < 1). (1) 

If p < oo, then 

lim||« r -/|| p = 0. (2) 

r~* 1 

Proof If we apply Jensen’s inequality (or Holder’s) to 

U r ( eW ) = ^ f(t)Pr(0 - t) dt (3) 
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we obtain 


I U r (e ie ) r < Y n I* I W \ pp ' (e - dt - ( 4 ) 

If we integrate (4) with respect to 0 , over [ — n, 7r] and use Fubini’s theorem, 
we obtain (1). 

Note that formula 11.5(3) was used twice in this argument. 

To prove (2), choose e > 0, and choose g e C(T) so that || g — f\\ p < e 
(Theorem 3.14). Let v = P\_g~\ . Then 


«r -/ = («, - V r) + (V r ~ d) + iff “/)• (5) 

By (1), || u r - iv||, = ||(w - ©X.II, < ||/- g\\ p < e. Thus 

I|m. —/lip <2e + ||o r — g\\ p (6) 

for all r < 1. Also, \\v r — g\\ p < \\v r — gW^, and the latter converges to 0 as 
r^> 1, by Theorem 11.8. This proves (2). //// 


11.17 Poisson Integrals of Measures If \i is a complex measure on T, and if we 
want to replace integrals over T by integrals over intervals of length 2n in R 1 , 
these intervals have to be taken half open, because of the possible presence of 
point masses in g. To avoid this (admittedly very minor) problem, we shall keep 
integration on the circle in what follows, and will write the Poisson integral 
u = P[dg\ of g in the form 


u(z) = 


P(z , e lt ) dg(e lt ) 


(zeU) 


(1) 


where P(z, e lt ) = (1 — | z \ 2 )/ \e lt — z | 2 , as in formula 11.5(6). 

The reasoning that led to Theorem 11.7 applies without change to Poisson 
integrals of measures. Thus w, defined by (1), is harmonic in U. 

Setting \\g\\ = \ \i \ (T), the analogue of the first half of Theorem 11.16 is 




\u(re i0 )\dO<M. 


( 2 ) 


To see this, replace \i by \g \ in (1), apply Fubini’s theorem, and refer to formula 
11.5(3). 


11.18 Approach Regions For 0 < a < 1, we define Q a to be the union of the disc 
D(0; a) and the line segments from z = 1 to points of D(0; a). 

In other words, Q a is the smallest convex open set that contains D(0; a) and 
has the point 1 in its boundary. Near z = 1, Q a is an angle, bisected by the radius 
of U that terminates at 1, of opening 20, where a = sin 0. Curves that approach 1 
within Q a cannot be tangent to T. Therefore Q a is called a nontangential approach 
region, with vertex 1. 
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The regions Q a expand when a increases. Their union is U, their intersection 
is the radius [0, 1). 

Rotated copies of Q a , with vertex at e i% , will be denoted by . 

11.19 Maximal Functions If 0 < a < 1 and u is any complex function with 
domain U, its nontangential maximal function N a u is defined on T by 

(N a u)(e“) = sup { | u(z) | : 2 e e“Cl x }. (1) 

Similarly, the radial maximal function of u is 

(M rad u)(e i( ) = sup { | uire*) | : 0 < r < 1}. (2) 

If u is continuous and 2 is a positive number, then the set where either of 
these maximal functions is <2 is a closed subset of T. Consequently, N a u and 
M rad u are lower semicontinuous on T ; in particular, they are measurable. 

Clearly, M rad w < N a u , and the latter increases with a. If u = P\_dp ], Theorem 

11.20 will show that the size of N a u is, in turn, controlled by the maximal func- 
tion Mp that was defined in Sec. 7.2 (taking k = 1). However, it will simplify the 
notation if we replace ordinary Lebesgue measure m on T by o = m/2n. Then o is 
a rotation-invariant positive Borel measure on T, so normalized that o(T) = 1. 

Accordingly, M\i is now defined by 

(AW») = sup-^p. (3) 

The supremum is taken over all open arcs / c= T whose centers are at e l °, includ- 
ing T itself (even though T is of course not an arc). 

Similarly, the derivative Dfi of a measure \i on T is now 

m(e ie ) = (4) 

0 ( 1 ) 

as the open arcs / c= T shrink to their center e l9 9 and e l9 is a Lebesgue point of 
/ e I}(T) if 

lim ^\f—f(e ie )\ do = °, (5) 

where {/} is as in (4). 

If dp =f do + dp s is the Lebesgue decomposition of a complex Borel 
measure p on T, where / e Lf(T) and p s _L a, Theorems 7.4, 7.7, and 7.14 assert 
that 

a{Mii >*}<\ Ml, (6) 

that almost every point of T is a Lebesgue point of f and that Dp = f Dp s = 0 
a.e. [a]. 
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We will now see, for any complex Borel measure p on 7, that the non- 
tangential and radial maximal functions of the harmonic function P[d/i] are con- 
trolled by M/i. In fact, if any one of them is finite at some point of 7, so are the 
others; this can be seen by combining Theorem 11.20 with Exercise 19. 

11.20 Theorem Assume 0 < a < 1. Then there is a constant c a > 0 with the 
following property: If p is a positive finite Borel measure on T and u = P\_dp\ 
is its Poisson integral , then the inequalities 

c x (N x u)(e w ) < (M rad u)(e w ) < (AW*) (1) 

hold at every point e ld e 7. 

Proof We shall prove (1) for 0 = 0. The general case follows then if the 
special case is applied to the rotated measure p e (E) = p(e ld E). 

Since u(z) = J r P(z, e lt ) dp(e lt \ the first inequality in (1) will follow if we 
can show that 


c a P(z,^)<P(|z|,^) (2) 

holds for all z e Q a and all e ix e 7. By formula 11.5(6), (2) is the same as 

c a \e u — r | 2 < \e ix — z\ 2 (3) 

where r = \ z\. 

The definition of Q a shows that | z — r | /(I — r) is bounded in Q a , say by 
y a . Hence 

| e lt — r | < | e ix — z | + | z — r \ 

< | — z | + y a (l — r) < (1 -h y a ) | e ix — z \ 

so that (3) holds with c a = (1 -h y a ) -2 . This proves the first half of (1). 

For the second half, we have to prove that 

P r (t) dp(e ix ) < (Mp)( 1) (0 < r < 1). (4) 

Jr 

Fix r. Choose open arcs Ij a 7, centered at 1, so that J x c= J 2 c= • • c= I n - l9 
put /„ = 7. For 1 <j < n, let Xj be the characteristic function of Ij , and let hj 
be the largest positive number for which h } < P r on 7. Define 

K = i(hj-h j+l ) X j (5) 

j= i 

where h n + l =0. Since P r (t) is an even function of t that decreases as t 
increases from 0 to 7c, we see that hj — h j+l >0, that K = hj on Ij — I j~ i 
(putting J 0 = 0), and that K < P r . The definition of Mp shows that 

p(Ij) < (Mp)(l)o(Ij). 


( 6 ) 
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Hence, setting (Mp)( 1) = M, 


K dn = £ (hj — h j+ MIj) <M I (/!, - h J+ MIj ) 



Finally, if we choose the arcs Ij so that their endpoints form a sufficiently fine 
partition of T, we obtain step functions K that converge to P r , uniformly on 
T. Hence (4) follows from (7). //// 


11.21 Nontangential Limits A function F, defined in (/, is said to have non - 
tangential limit X at e ld e T if, for each a < 1, 

lim F(Zj) = A 

J-+ oo 

for every sequence {z^} that converges to and that lies in e ie Q a . 

11.22 Theorem If g is a positive Borel measure on T and ( Dp)(e ld ) = 0 for some 
9 , then its Poisson integral u = P[_dp] has nontangential limit 0 at e ld . 

Proof By definition, the assumption (Dp)(e id ) = 0 means that 

lim p(I)/o(I) = 0 (1) 

as the open arcs / c= T shrink to their center e 10 . Pick e > 0. One of these 
arcs, say I 0 , is then small enough to ensure that 

KI) < «*/) ( 2 ) 

for every / c= I 0 that has e ie as center. 

Let p 0 be the restriction of \i to / 0 , put — p — p 0 , and let be the 
Poisson integral of p t (i = 0, 1). Suppose Zj converges to e ie within some 
region e i0 Q a . Then Zj stays at a positive distance from T — 1 0 . The inte- 
grands in 


ufzj) = [ P(zj , e il ) dp(e lt ) 

JT-Io 

converge therefore to 0 as 7 — > 00, uniformly on T — I 0 . Hence 

lim w 1 (z J ) = 0 . 

j-+ °o 

Next, use (2) together with Theorem 11.20 to see that 


(3) 

(4) 


c a (N a u 0 )(e*) < (Mu oXO < €. 


(5) 
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In e ie Q x , u 0 (z) < (N a u 0 )(e w ). Hence (5) implies that 

lim sup u 0 (zj) < e/c a . (6) 

j~* 00 

Since u = u 0 + and 6 was arbitrary, (4) and (6) give 

lim u(zj) = 0. (7) 

7-oo 

//// 

11.23 Theorem J//e L^T), r/ien P[/] has nontangential limit f(e ld ) at every 
Lebesgue point e ld of f. 

Proof Suppose e ie is a Lebesgue point of/. By subtracting a constant from / 
we may assume, without loss of generality, that f{e ie ) = 0. Then 

um ^Ii / i ,to - 0 (i> 

as the open arcs I a T shrink to their center e ie . Define a Borel measure p on 
Tby 


p(E) = ^\f\do (2) 

Then (1) says that ( Dp\e ie ) = 0; hence P\_dp\ has nontangential limit 0 at e ld , 
by Theorem 11.22. The same is true of P[/], because 

ip[/]i<m/i] = pw (3) 

mi 


The last two theorems can be combined as follows. 


11.24 Theorem If dp = / da + dp s is the Lebesgue decomposition of a complex 
Borel measure p on T, where f e l}(T ), p s _L a, t/ien P[d/x] has nontangential 
limit f (e 10 ) at almost all points of T. 

Proof Apply Theorem 11.22 to the positive and negative variations of the 
real and imaginary parts of p s9 and apply Theorem 11.23 to /. //// 

Here is another consequence of Theorem 11.20. 

11.25 Theorem For 0 < a < 1 and 1 < p < oo, there are constants 
.4(a, p) < oo with the following properties: 
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(a) If p is a complex Borel measure on T, and u = P[dp\ then 

a{N a u > X] < ^ \\p\\ (0 < X < oo). 

(b) Ifl<p< oo ,/e Zf(T), and u = P[/], then 

\\N a u\\ p <A(oc, p)\\f\\ p . 

Proof Combine Theorem 11.20 with Theorem 7.4 and the inequality (7) in 
the proof of Theorem 8.18. //// 

The nontangential maximal functions N^u are thus in weak L 1 if u = P\dp ], 
and they are in B(T) if u = P[/] for some / e Zf(T), p > 1. This latter result may 
be regarded as a strengthened form of the first part of Theorem 11.16. 

Representation Theorems 

11.26 How can one tell whether a harmonic function u in U is a Poisson integral 
or not? The preceding theorems (11.16 to 11.25) contain a number of necessary 
conditions. It turns out that the simplest of these, the If-boundedness of the 
family {u r : 0 < r < 1} is also sufficient! Thus, in particular, the boundedness of 
\\u r \\ u as r— > 1, implies the existence of nontangential limits a.e. on T, since, as we 
will see in Theorem 11.30, u can then be represented as the Poisson integral of a 
measure. 

This measure will be obtained as a so-called “ weak limit ” of the functions u r . 
Weak convergence is an important topic in functional analysis. We will approach 
it through another important concept, called equicontinuity, which we will meet 
again later, in connection with the so-called “ normal families ” of holomorphic 
functions. 

11.27 Definition Let 3F be a collection of complex functions on a metric space X 
with metric p. 

We say that 3F is equicontinuous if to every e > 0 corresponds a S > 0 such 
that \f(x)— f(y)\<e for every / e and for all pairs of points x, y with 
p(x, y) < S . (In particular, every / e & is then uniformly continuous.) 

We say that 3F is pointwise bounded if to every x e X corresponds an 
M(x) < oo such that | f(x) | < M(x) for every / e 3F. 

11.28 Theorem (Arzela-Ascoli) Suppose that 3? is a pointwise bounded equi- 
continuous collection of complex functions on a metric space X , and that X contains 
a countable dense subset E. 

Every sequence {/„} in 3F has then a subsequence that converges uniformly on 
every compact subset of X. 
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Proof Let x 1} x 2 , x 3 , . . . be an enumeration of the points of E. Let S 0 be the 
set of all positive integers. Suppose k > 1 and an infinite set c= S 0 has 
been chosen. Since {f n (x k ): n g S k ^ t } is a bounded sequence of complex 
numbers, it has a convergent subsequence. In other words, there is an infinite 
set S k a S k -! so that lim f n (x k ) exists as n—* oo within S k . 

Continuing in this way, we obtain infinite sets S 0 z> S t z> S 2 * * * with 
the property that lim f n (xj) exists, for 1 <j < k, if n-* oo within S k . 

Let r k be the kth term of S k (with respect to the natural order of the 
positive integers) and put 


s = {ri, r 2 , r 3 , 

For each k there are then at most k — 1 terms of S that are not in S k . 

Hence lim f n (x) exists, for every x g E, as n-+ oo within S. 

(The construction of S from {S k } is the so-called diagonal process .) 

Now let K a X be compact, pick e > 0. By equicontinuity, there is a 
S > 0 so that p(p, q) < S implies | f n (p) — f n (q) \ < e, for all n. Cover K with 
open balls B u ..., B M of radius S/2. Since E is dense in X there are points 
Pi e Bi n E for 1 < i < M. Since p t e E, lim /„(p f ) exists, as n— ► oo within S. 
Hence there is an integer N such that 

\fm(Pi)-fn(Pi)\ <e 

for i = 1, . . . , M, if m > N, n > N, and m and n are in S. 

To finish, pick x e K. Then x e B t for some i, and p(x, p t ) < S. Our 
choice of S and N shows that 

I fm(x) —/„(*) | < I fjx) -f m {Pi) \ + | f m {Pi) -f„(Pi)\ + \f„(Pi) ~fn(x)\ 

<6 + 6 + 6 = 36 

if m > N, n > N, m g S, n g S. //// 

11.29 Theorem Suppose that 

(a) X is a separable Banach space, 

(b) { A„} is a sequence of linear functionals on X, 

(c) sup II A„ II = M < 00. 

n 

Then there is a subsequence {A„.} such that the limit 

Ax = lim A„. x (1) 

i-> oo 


exists for every x g X. Moreover, A is linear, and ||A|| < M. 

(In this situation, A is said to be the weak limit of {A„.}; see Exercise 18.) 
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Proof To say that X is separable means, by definition, that X has a count- 
able dense subset. The inequalities 

I A„x| < M\\x\\, I A„ x' - A n x"\ < M\\x' - x"|| 

show that {A„} is pointwise bounded and equicontinuous. Since each point 
of X is a compact set, Theorem 11.28 implies that there is a subsequence 
{A„.} such that {A w .x} converges, for every x e I, as i-> oo. To finish, define 
A by (1). It is then clear that A is linear and that || A|| < M. //// 

Let us recall, for the application that follows, that C(T) and If(T)(p < oo) are 
separable Banach spaces, because the trigonometric polynomials are dense in 
them, and because it is enough to confine ourselves to trigonometric polynomials 
whose coefficients lie in some prescribed countable dense subset of the complex 
field. 

11.30 Theorem Suppose u is harmonic in 17, 1 < p < oo, and 

sup II u r \\ p = M < oo. (1) 

0 <r< 1 

(a) If p = 1, it follows that there is a unique complex Borel measure p on T so 
that u = P\_dp ]. 

(b) If p > 1, it follows that there is a unique f e LF(T) so that u = P[/]. 

(c) Every positive harmonic function in U is the Poisson integral of a unique 
positive Borel measure on T. 


Proof Assume first that p = 1. Define linear functionals A r on C(T) by 


Kg = 


J gu r da 


(0 < r < 1). 


( 2 ) 


By (1), || A r || < M. By Theorems 11.29 and 6.19 there is a measure p on T, 
with || p\\ < M, and a sequence r,— > 1, so that 

lim gu r . da = g dp (3) 

j-> oo Jr Jr 


for every g e C(T). 

Put hfz) = u(rj z). Then hj is harmonic in 17, continuous on £7, and is 
therefore the Poisson integral of its restriction to T (Theorem 11.9). Fix 
z e U 9 and apply (3) with 


g(e“) = P{z , e“). 


(4) 
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Since hjie*) = u rj {e lt ), we obtain 

u(z) = lim u(rjz) = lim h } {z) 
j j 


= lim 


j 


P(z, e lt )hj(e lt ) do(e lt ) 


L 


P(z, e“) dn(e u ) = P[dnJz). 


If 1 < p < oo, let q be the exponent conjugate to p. Then I3(T) is separa- 
ble. Define A r as in (2), but for all g e I3(T). Again, ||A r || < M. Refer to Theo- 
rems 6.16 and 11.29 to deduce, as above, that there is an /e L P (T), with 
ll/llp < M, so that (3) holds, with / dcr in place of dp, for every g e I3(T). The 
rest of the proof is as it was in the case p — 1. 

This establishes the existence assertions in (a) and ( b ). To prove unique- 
ness, it suffices to show that P\dp\ = 0 implies p = 0. 

Pick / e C(T), put u = P[/], v = P[dp']. By Fubini’s theorem, and the 
symmetry P(re ie , e il ) = P(re l \ e w ), 




(0 < r < 1). 


( 5 ) 


When v = 0 then v r — 0, and since u r ^>f uniformly, as 1, we conclude 
that 


J r /^ = 0 (6) 

for every / e C(T) if P\dp ] = 0. By Theorem 6.19, (6) implies that p — 0. 
Finally, (c) is a corollary of (a), since u > 0 implies (1) with p = 1 : 

| u r \d<T = w r = w(0) (0 < r < 1) (7) 

Jr Jr 

by the mean value property of harmonic functions. The functionals A r used 
in the proof of (a) are now positive, hence p> 0. //// 

11.31 Since holomorphic functions are harmonic, all of the preceding results (of 
which Theorems 11.16, 11.24, 11.25, 11.30 are the most significant) apply to holo- 
morphic functions in U. This leads to the study of the H p - spaces, a topic that will 
be taken up in Chap. 17. 

At present we shall only give one application, to functions in the space //°°. 
This, by definition, is the space of all bounded holomorphic functions in U ; the 
norm 


ll/ll oo = sup { |/(z) | : 2 e U} 
turns H°° into a Banach space. 
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As before, T°(T) is the space of all (equivalence classes of) essentially 
bounded functions on T, normed by the essential supremum norm, relative to 
Lebesgue measure. For g e \\g\\ ^ stands for the essential supremum of | g \ . 

11.32 Theorem To every /eH® corresponds a function /* e E°(T), defined 
almost everywhere by 

f*{e w ) = Km f{re m ). (1) 

r~* 1 

The equality 11/ 1|. = ||/*IL holds. 

Iff*( eld ) — 0 f or almost all e ld on some arc I cz T, then f(z) = 0 for every 
zeU. 

(A considerably stronger uniqueness theorem will be obtained later, in 
Theorem 15.19. See also Theorem 17.18 and Sec. 17.19.) 

Proof By Theorem 11.30, there is a unique g e L°°(T) such that / = P[_gf By 
Theorem 11.23, (1) holds with /* = g. The inequality WfW^ < ||/*|| Q0 follows 
from Theorem 11.16(1); the opposite inequality is obvious. 

In particular , iff* = 0 a.e ., then ||/*|| „ = 0, hence ||/|| „ = 0, hence f = 0. 
Now choose a positive integer n so that the length of / is larger than 
2n/n. Let a = exp {2ni/n} and define 

m = n /(«**) (* £ U). (2) 

k= 1 

Then FeH 00 and F* = 0 a.e. on T, hence F(z) = 0 for all z e U. If Z(/), the 
zero set of / in U , were at most countable, the same would be true of Z(F), 
since Z(F) is the union of n sets obtained from Z(/) by rotations. But 
Z(F) = U. Hence / = 0, by Theorem 10.18. //// 

Exercises 

1 Suppose u and v are real harmonic functions in a plane region Q. Under what conditions is uv 
harmonic? (Note that the answer depends strongly on the fact that the question is one about real 
functions.) Show that u 2 cannot be harmonic in Q, unless u is constant. For which / e H( Q) is |/| 2 
harmonic? 

2 Suppose / is a complex function in a region Q, and both / and f 2 are harmonic in Q. Prove that 
either /or /is holomorphic in Q. 

3 If u is a harmonic function in a region Q, what can you say about the set of points at which the 
gradient of u is 0? (This is the set on which u x = u y = 0.) 

4 Prove that every partial derivative of every harmonic function is harmonic. 

Verify, by direct computation, that P r {9 — t ) is, for each fixed t , a harmonic function of re w . 
Deduce (without referring to holomorphic functions) that the Poisson integral P[dfi\ of every finite 
Borel measure n on T is harmonic in U, by showing that every partial derivative of P[dfi] is equal to 
the integral of the corresponding partial derivative of the kernel. 

5 Suppose / e H{ Q) and / has no zero in Q. Prove that log | / 1 is harmonic in Q, by computing its 
Laplacian. Is there an easier way? 
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6 Suppose / e H(U), where U is the open unit disc, /is one-to-one in U, Q = f(U), and f{z) = ]T c n z n . 
Prove that the area of Q is 


n £«|c„l 2 - 

n= 1 

Hint : The Jacobian of/is | /' | 2 . 

7 (a) If/e H(Q),f(z ) # 0 for z e Q, and — oo < a < oo, prove that 

A(i/n = a 2 i/r- 2 i/'i 2 , 

by proving the formula 


m'i'°uf)) = (<p°\f\ 2 )-\r\\ 


in which i ft is twice differentiable on (0, oo) and 


(p(t) = 

(b) Assume / e H( Q) and $ is a complex function with domain / (Q), which has continuous 
second-order partial derivatives. Prove that 

A[a>o/] = [(A0))o/] • |/'| 2 . 


Show that this specializes to the result of ( a ) if <l>(w) = d>( | w |). 

8 Suppose Q is a region,/, e H{Q) for n = 1, 2, 3, . u n is the real part of /„, {«„} converges uni- 
formly on compact subsets of Q, and {/„(z)} converges for at least one zeQ. Prove that then {/„} 
converges uniformly on compact subsets of Q. 

9 Suppose u is a Lebesgue measurable function in a region Q, and u is locally in L 1 . This means that 
the integral of | u \ over any compact subset of Q is finite. Prove that u is harmonic if it satisfies the 
following form of the mean value property : 


u{a) 


i rr 

= — u{x, 

nr J. 


y ) dx dy 


whenever D{a\ r ) c: Q. 

10 Suppose / = [a, b~\ is an interval on the real axis, q> is a continuous function on /, and 


/(z) = 


2ni 


<P(t) 


dt 


(z ^ /). 


Show that 


lim lf{x + ie) -f{x - ie)] (e > 0) 
f -o 

exists for every real x, and find it in terms of q>. 

How is the result affected if we assume merely that (p e ill What happens then at points x at 
which (p has right- and left-hand limits? 

11 Suppose that / = [ a , h], Q is a region, / c: Q,/is continuous in Q, and / e H{ Q — I). Prove that 
actually / e H( Q). 

Replace / by some other sets for which the same conclusion can be drawn. 

12 (Harnack’s Inequalities) Suppose Q is a region, K is a compact subset of Q, z 0 e Q. Prove that 
there exist positive numbers a and / (depending on z 0 , K, and Q) such that 

a u(z 0 ) < u(z) < 0u(z o ) 

for every positive harmonic function u in Q and for all z e K. 
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If {u n } is a sequence of positive harmonic functions in Q and if u„{z 0 )— ► 0, describe the behavior 
of {«„} in the rest of Q. Do the same if u n {z 0 )— > oo. Show that the assumed positivity of {u n } is 
essential for these results. 

13 Suppose u is a positive harmonic function in U and m( 0) = 1. How large can u{j) be? How small? 
Get the best possible bounds. 

14 For which pairs of lines L x , L 2 do there exist real functions, harmonic in the whole plane, that are 
0 at all points of L x u L 2 without vanishing identically? 

15 Suppose u is a positive harmonic function in U, and u(re i0 ) — > 0 as r— ► 1, for every e ie # 1. Prove 
that there is a constant c such that 


u(re ie ) = cP r {0). 


16 Here is an example of a harmonic function in U which is not identically 0 but all of whose radial 
limits are 0: 


u{z) — Im 



Prove that this u is not the Poisson integral of any measure on T and that it is not the difference of 
two positive harmonic functions in U. 

17 Let O be the set of all positive harmonic functions u in U such that m( 0) = 1. Show that O is a 
convex set and find the extreme points of O. (A point x in a convex set O is called an extreme point of 
O if x lies on no segment both of whose end points lie in $ and are different from x.) Hint : If C is the 
convex set whose members are the positive Borel measures on T , of total variation 1, show that the 
extreme points of C are precisely those pe C whose supports consist of only one point of T. 

18 Let X* be the dual space of the Banach space X. A sequence {A„} in X* is said to converge weakly 
to A e X* if A„x— > Ax as n— > oo, for every x e X. Note that A„ — > A weakly whenever A„-+ A in the 
norm of X*. (See Exercise 8, Chap. 5.) The converse need not be true. For example, the functionals 
/— >f{n) on L 2 (T) tend to 0 weakly (by the Bessel inequality), but each of these functionals has norm 1. 

Prove that {||AJ|} must be bounded if {A„} converges weakly. 

19 (a) Show that 3P r {3) > 1 if 5 = 1 — r. 

(b) If p > 0, u = P[dp], and l d a T is the arc with center 1 and length 23, show that 


p(l d ) < 3u{l - 3) 

and that therefore 


(M^Xl)<7r(M radM Xl). 

(c) If, furthermore, p _L m, show that 

u{re i0 )— > oo a.e. [/i]. 

Hint : Use Theorem 7.15. 

20 Suppose E a T, m{E) = 0. Prove that there is an/ e H 00 , with /( 0) = 1, that has 

lim/(re*) = 0 

at every e 10 e E. 

Suggestion : Find a lower semicontinuous \j/ e I}{T), \j/ > 0, if/ = + oo at every point of E. There 
is a holomorphic g whose real part is P[iA]- Let /= 1/g. 

21 Define /e H(U) and g e H(U) by f(z ) = exp {(1 + z)/( 1 - z)}, g(z) = (1 - z) exp {-/(z)}. Prove 
that 


g*(e ie ) = lim g(re ie ) 

r~* 1 

exists at every e i0 e T, that g* e C(T), but that g is not in H 00 . 
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Suggestion : Fix s, put 


t + is — 1 
t + is + 1 


(0 < t < oo). 


For certain values of s, | g(z t ) | — ► oo as t — ► oo. 

22 Suppose u is harmonic in U, and {u r : 0 < r < 1} is a uniformly integrable subset of l}(T). (See 
Exercise 10, Chap. 6.) Modify the proof of Theorem 11.30 to show that u = P[/] for some / e l}(T). 

23 Put 6 n = 2~ n and define 


u(z) = £ «' 2 {P(Z, «"*) - P(z, 

B= 1 

for z e U. Show that u is the Poisson integral of a measure on T , that u(x) = 0 if — 1 < x < 1, but 
that 


u( 1 — € 4- I€) 

is unbounded, as € decreases to 0. (Thus u has a radial limit, but no nontangential limit, at 1.) 
Hint : If € = sin 6 is small and z = 1 — e + /€, then 

P(z, e*) - P(z, <?-*) > 1/e. 

24 Let D„(t) be the Dirichlet kernel, as in Sec. 5.11, define the Fejer kernel by 

^ n = KT < (Do + + ' * + D n ), 

N + 1 


put L N (t) - min ( N , n 2 /Nt 2 ). Prove that 


K 


N - 



1 — cos Nt 
1 - cos t 




and that J r < 2. 

Use this to prove that the arithmetic means 

S 0 + Si + • • • + S N 
Gn ~ N + 1 

of the partial sums s n of the Fourier series of a function / e I}(T ) converge to f(e ld ) at every Lebesgue 
point of /. (Show that sup | o N \ is dominated by Mf then proceed as in the proof of Theorem 1 1.23.) 

25 If 1 < p < co and / e /^(P 1 ), prove that (/ * h x )(x) is a harmonic function of x 4- in the upper 
half plane. (h x is defined in Sec. 9.7 ; it is the Poisson kernel for the half plane.) 



CHAPTER 

TWELVE 

THE MAXIMUM MODULUS PRINCIPLE 


Introduction 

12.1 The maximum modulus theorem (10.24) asserts that the constants are the 
only homomorphic functions in a region Q whose absolute values have a local 
maximum at any point of Q. 

Here is a restatement: If K is the closure of a bounded region Q, if f is contin- 
uous on K and holomorphic in Q, then 

\f(z)\ < H/H 0 Q (1) 

for every zeQ. If equality holds at one point zeQ, then f is constant . 

[The right side of (1) is the supremum of | / 1 on the boundary dQ of Q.] 

For if | /(z) | > || / 1| aQ at some zeQ, then the maximum of | / 1 on K (which is 
attained at some point of K , since K is compact) is actually attained at some 
point of Q, so /is constant, by Theorem 10.24. 

The equality 11/1100= ||/*|| 00 , which is part of Theorem 11.32, implies that 

1/001 <11 nice (zeUJeH^U)). (2) 

This says (roughly speaking) that |/(z)| is no larger than the supremum of the 
boundary values of / a statement similar to (1). But this time boundedness on U 
is enough; we do not need continuity on £7. 

This chapter contains further generalizations of the maximum modulus 
theorem, as well as some rather striking applications of it, and it concludes with a 
theorem which shows that the maximum property “almost” characterizes the 
class of holomorphic functions. 
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The Schwarz Lemma 

This is the name usually given to the following theorem. We use the notation 
established in Sec. 11.31. 

12.2 Theorem Suppose f e H 00 , ||/|| 0 0 < 1, and /( 0) = 0. Then 

\f(z)\<\z\ (zeU), (1) 

I /'(0) I < l ; (2) 

if equality holds in (1) for one z e U — {0}, or if equality holds in (2), then 
f(z) = Az, where X is a constant , \X \ = 1. 


In geometric language, the hypothesis is that / is a holomorphic mapping of 
U into U which keeps the origin fixed; part of the conclusion is that either / is a 
rotation or / moves each z e U — { 0} closer to the origin than it was. 


Proof Since /( 0) = 0, f(z)/z has a removable singularity at z = 0. Hence 
there exists g e H(U) such that /(z) = zg(z). If z e U and | z | < r < 1, then 


I g(z) | < max 
e 


I /(re”) I 

r 


< 


1 

r 


Letting r— ► 1, we see that \g(z)\ < 1 at every z e U. This gives (1). Since 
/'( 0) = 0(0), (2) follows. If | g(z) | = 1 for some z e U, then g is constant, by 
another application of the maximum modulus theorem. //// 


Many variants of the Schwarz lemma can be obtained with the aid of the 
following mappings of U onto U : 

12.3 Definition For any a el/, define 


<p«(z) = 


z — a 
1 — az 


12.4 Theorem Fix a e U. Then q) a is a one-to-one mapping which carries T 
onto T, U onto U, and a to 0. The inverse of (p a is (p- a .We have 

<Pa(0) = 1 — | a | 2 , cp' x (cc) = -f- J. (1) 

1 - |a| 


Proof <p a is holomorphic in the whole plane, except for a pole at 1/a which 
lies outside U. Straightforward substitution shows that 


<P-a:((Pa(z)) = *• 


(2) 
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Thus (p a is one-to-one, and <p_ a is its inverse. Since, for real t , 


e lt — a | e lt — a | 

1 — dce lt | e~ il — a | 


( 3 ) 


(z and z have the same absolute value), <p a maps T into T; the same is true of 
<p_ a ; hence (p a {T) = T. It now follows from the maximum modulus theorem 
that (p a (U) cz U, and consideration of <p_ a shows that actually (pJJJ) = U. //// 


12.5 An Extremal Problem Suppose a and /? are complex numbers, |a| < 1, and 
| P | < 1. How large can | /'(a) | be iff is subject to the conditions f e H 00 , || / 1| „ < 1, 
andf( a) = /J? 

To solve this, put 


Q = (pp°f° <P-*- (1) 

Since (p_ a and cp p map U onto U , we see that g e H™ and WgW^ < 1; also, 
g( 0) = 0. The passage from f to g has reduced our problem to the Schwarz 
lemma, which gives | g'(0) | < 1. By (1), the chain rule gives 


g\ 0 ) = cp' p m f (oc)cp'_M 

If we use Eqs. 12.4(1), we obtain the inequality 


l/'(a)|< 


i-i/? i 2 
i-M 2 ' 


(2) 

( 3 ) 


This solves our problem, since equality can occur in (3). This happens if and 
only if | g'(0) | = 1, in which case g is a rotation (Theorem 12.2), so that 

/(z) = <P-f{foPcfc)) ( z e V) ( 4 ) 

for some constant X with \X \ = 1. 

A remarkable feature of the solution should be stressed. We imposed no 
smoothness conditions (such as continuity on U, for instance) on the behavior of 
/ near the boundary of U. Nevertheless, it turns out that the functions / which 
maximize |/'(<*)| under the stated restrictions are actually rational functions. 
Note also that these extremal functions map U onto U (not just into) and that 
they are one-to-one. This observation may serve as the motivation for the proof 
of the Riemann mapping theorem in Chap. 14. 

At present, we shall merely show how this extremal problem can be used to 
characterize the one-to-one holomorphic mappings of U onto U. 


12.6 Theorem Suppose f e H(U ), / is one-to-one , f(U) =1/, a el/, and 
/(a) = 0. Then there is a constant X, \X \ = 1, such that 

f(z) = A<p a (z) (z e U). (1) 

In other words, we obtain /by composing the mapping with a rotation. 
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Proof Let g be the inverse of / defined by g(f(z)) = z, z e JJ. Since / is one- 
to-one, /' has no zero in U, so g e H(U), by Theorem 10.33. By the chain 
rule, 


mm = i. (2) 

The solution of 12.5, applied to / and to g, yields the inequalities 

I /'(<*) | < - [j > lff'(0)l<l-|a| 2 . (3) 

1 — I a I 

By (2), equality must hold in (3). As we observed in the preceding problem 
(with = 0), this forces / to satisfy (1). //// 

The Phragmen-Lindelof Method 

12.7 For a bounded region Q, we saw in Sec. 12.1 that if /is continuous on the 
closure of Q and if / e H(Q\ the maximum modulus theorem implies 

ll/lln = II /II an* (1) 

For unbounded regions, this is no longer true. 

To see an example, let 

_ f 71 tz) 

Q = jz = X + iy: - - < y < (2) 

Q is the open strip bounded by the parallel lines y = ±n/2; its boundary dQ. is 
the union of these two lines. Put 

f(z) = exp (exp (z)). (3) 

For real x, 

± = exp (±ie x ) (4) 

since exp (ni/2) = i, so | f(z) \ = 1 for z e dQ. But f(z)-* oo very rapidly as x— ► oo 
along the positive real axis, which lies in Q. 

“Very” is the key word in the preceding sentence. A method developed by 
Phragmen and Lindelof makes it possible to prove theorems of the following 
kind: If / e H(Q) and if |/| < g, where #(z)— ► oo “slowly” as z— ► oo in Q (just 
what “ slowly ” means depends on Q), then / is actually bounded in Q, and this 
usually implies further conclusions about / by the maximum modulus theorem. 

Rather than describe the method by a theorem which would cover a large 
number of situations, we shall show how it works in two cases. In both, Q will be 
a strip. In the first, / will be assumed to be bounded, and the theorem will 
improve the bound ; in the second, a growth condition will be imposed on / which 
just excludes the function (3). In view of later applications, Q will be a vertical 
strip in Theorem 12.8. 
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First, however, let us mention another example which also has this general 
flavor: Suppose f is an entire function and 

\f(z)\<l +\z\ 1 ' 2 (5) 

for all z. Then f is constant . 

This follows immediately from the Cauchy estimates 10.26, since they show 
that f {n \ 0) = 0 for n = 1, 2, 3, 

12.8 Theorem Suppose 

Q = {x + iy: a < x < b}, Cl = {x + iy: a < x < b} 9 (1) 

/ is continuous on Cl, f e H(Q), and suppose that \f(z)\ < B for all z e Q and 
some fixed B < oo .If 

M(x) = sup { I f(x + iy) | : — oo < y < oo} (a < x < b) (2) 

then we actually have 

M(x) b ~ a < M(a) b - x M(b) x ~ a (a < x < b). (3) 

Note: The conclusion (3) implies that the inequality \ f\<B can be replaced by 
\f\< max (M(a), M(b)), so that | / 1 is no larger in Q than the supremum of | / 1 
on the boundary of Q. 

If we apply the theorem to strips bounded by lines x = a and x = /?, where 
a < a < P < b, the conclusion can be stated in the following way: 

Corollary Under the hypotheses of the theorem , log M is a convex function on 
(a, b). 

Proof We assume first that M(a) = M(b) = 1. In this case we have to prove 
that | f(z) | < 1 for all zeQ. 

For each e > 0, we define an auxiliary function 

MZ) - 1 T i - a) (2 6 n >' (4) 

Since Re {1 + e(z — a)} = 1 + e(x — a) > 1 in Q, we have | h e \ < 1 in Q, so 
that 

\f(z)K(z)\<\ (zeday (5) 

Also, 1 1 + e(z — a) | > e | y \ , so that 

I f(z)h e (z) | < (z = x + iy e £j). (6) 

Let R be the rectangle cut off from Q by the lines y = ± B/e. By (5) and 
(6), \fh € \ < 1 on dR , hence \fh e \<l on R, by the maximum modulus 
theorem. But (6) shows that \fh e \ < 1 on the rest of Q. Thus \f(z)h e (z)\ < 1 
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for all z e Q and all e > 0. If we fix z e Q and then let e— ► 0, we obtain the 
desired result |/(z)| < 1. 

We now turn to the general case. Put 

g(z) = M(a) ib ~ z)lib ~ a) M(b) iz ~ a)lib ~ fl) , (7) 

where, for M > 0 and w complex, M w is defined by 

M w = exp (w log M), (8) 

and log M is real. Then g is entire, g has no zero, 1/g is bounded in Q, 

I g(a + iy) I = M(a), I g(b + iy) \ = M{b), (9) 

and hence f/g satisfies our previous assumptions. Thus | f/g \ < 1 in Q, and 
this gives (3). (See Exercise 7.) //// 

12.9 Theorem Suppose 

= jx + iy: \y \ < ||, H = jx + iy: |_y| < |1. (1) 

Suppose f is continuous onQfe ET(Q), there are constants a < 1, A < oo, such 
that 

|/(z)| < exp {A exp (a|x|)} (z = x + iy e Q), (2) 

and 

±^J <1 (- oc < x < oo ). (3) 

Then \f(z) \ < 1 for all zeQ. 

Note that the conclusion does not follow if a = 1, as is shown by the function 
exp (exp z). 

Proof Choose f > 0 so that a</?<l.Fore>0, define 

h € (z) = exp { -e{e Pz + e~ fiz )}. (4) 

For zeQ, 

Re [<** + e~ fiz ] = (e fix + e~ fix ) cos fy > S(e^ x + e~^ x ) (5) 

where S = cos (fin/2) > 0, since | f \ < 1. Hence 

| h e (z) | < exp { -€S(e fix + e~^ x )} <1 (ze Q). (6) 


It follows that | fh e \ < 1 on dQ and that 

lf(z)h £ (z)\ < exp {Ae* lxl - eS(e Px + e~ Px )} (z e. Q). 


( 7 ) 
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Fix 6 > 0. Since ed > 0 and /? > a, the exponent in (7) tends to — oo as 
x — ► + oo. Hence there exists an x 0 so that the right side of (7) is less than 1 
for all x > x 0 . Since | fh e \ < 1 on the boundary of the rectangle whose ver- 
tices are ± x 0 ± (ni/2), the maximum modulus theorem shows that actually 
| fh e | < 1 on this rectangle. Thus \fh € \<l at every point of Q, for every 
6 > 0. As 6 — ► 0, h € (z)-* 1 for every z, so we conclude that |/(z)| < 1 for all 
Z E Q. III! 

Here is a slightly different application of the same method. It will be used in 
the proof of Theorem 14.18. 

12.10 Lindelof ’s Theorem Suppose T is a curve , with parameter interval [0, 1], 
such that | T(r) | < 1 if t < 1 and T(l) = 1. If g e H 00 and 

lim g(T(t)) = L, (1) 

t~*i 


then g has radial limit L at 1. 

(It follows from Exercise 14, Chap. 14, that g actually has nontangential 
limit L at 1.) 

Proof Assume \ g\ < 1, L = 0, without loss of generality. Let e > 0 be given. 
There exists t 0 < 1 so that, setting r 0 = Re T(t 0 ), we have 

| g(T(t)) | < c and Re r(t) > r 0 > 1 (2) 

as soon as t 0 < t < 1. 

Pick r,r 0 <r< 1. 

Define h in Q = D( 0; 1) n D(2r; 1) by 

h(z) = g(z)g(z)g(2r - z)g(2r - z). (3) 

Then h e H( Q) and | h \ < 1. We claim that 

I Hr) | < 6. (4) 

Since h(r) = | g(r) | 4 , the theorem follows from (4). 

To prove (4), let E x = T([t l9 1]), where t x is the largest t for which 
Re T(r) = r, let E 2 be the reflection of E x in the real axis, and let E be the 
union of u £ 2 and its reflection in the line x = r. Then (2) and (3) imply 
that 


| h(z) | < 6 if z e Q n E. 


( 5 ) 


Pick c > 0, define 


h c (z) = h(z)( 1 - z) c (2r - 1 - z)‘ 


( 6 ) 
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for zeQ, and put h c (l) — h c (2r — 1) = 0. If K is the union of E and the 
bounded components of the complement of £, then K is compact, h c is con- 
tinuous on K , holomorphic in the interior of K , and (5) implies that \ h c \ < e 
on the boundary of K. Since the construction of E shows that r e K, the 
maximum modulus theorem implies that | h c (r) \ < e. 

Letting c—> 0, we obtain (4). //// 


An Interpolation Theorem 

12.11 The convexity theorem 12.8 can sometimes be used to prove that certain 
linear transformations are bounded with respect to certain L p -norms. Rather than 
discuss this in full generality, let us look at a particular situation of this kind. 

Let X be a measure space, with a positive measure p, and suppose ( \j / n } 
(n = 1, 2, 3, ...) is an orthonormal set of functions in l3(p); we recall what this 
means : 


| dfl = I 


if m = n, 
if m^n. 


(i) 


Let us also assume that {\j/ n } is a bounded sequence in E°(p): There exists an 
M < oo such that 


l<A„M|<M (n= l,2,3,...;xel). (2) 

Then for any / e L p (/x), where 1 < p < 2, the integrals 

/(n) = £#„ dfi («= 1,2,3,...) (3) 

exist and define a function / on the set of all positive integers. 

There are now two very easy theorems: For / e L 1 ^), (2) gives 

ll/ll oo < ^ ll/ll i, (4) 

and for / e L 2 (/x), the Bessel inequality gives 

Il/ll 2 < II/II 2 . (5) 

where the norms are defined as usual : 

ll/ll 4 = [Zl/(n)N 1/< ', (6) 

and || / 1| oo = sup„|/(n)|. 


Since (1, 00 ) and (2, 2) are pairs of conjugate exponents, one may conjecture 
that ||/|| 4 is finite whenever / e L p (/x) and 1 < p < 2, q — p/(p — 1). This is indeed 
true and can be proved by “interpolation” between the preceding trivial cases 
p — 1 and p — 2. 
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12.12 The Hausdorff- Young Theorem Under the above assumptions , the 
inequality 

ll/ll, <M (2 -^||/|| p (1) 


holds if 1 < p < 2 and iff e L?(p). 


Proof We first prove a reduced form of the theorem. 

Fix p, 1 < p < 2. Let / be a simple complex function such that \\f\\ p = 1, 
and let b l9 . .., b N be complex numbers such that £ \ b n \ p = 1. Our objective 
is the inequality 


I *>„/(«) 


n = 1 


< m ( 2 ~ p)Ip . 


(2) 


Put F = | / | p , and put B„ = \ b„ | p . Then there is a function q> and there 
are complex numbers f} u ..., f} N such that 

f=F llp (p, M = l, |f dpi = 1, (3) 

and 

b n — Bj, /p P n , |/J„| = 1, £ = 1- (4) 

n = 1 

If we use these relations and the definition of f(n) given in Sec. 12.11, we 
obtain 


£ bj(n) = £ B i j p p n ^ llP ^n 


(5) 


n= 1 


Now replace 1/p by z in (5), and define 


0 ( 2 ) = 



F z (p\j/ n dfi 


( 6 ) 


for any complex number z. Recall that A z = exp (z log A) if A > 0; if A = 0, 
we agree that A z = 0. Since F is simple, since F > 0, and since B n > 0, we see 
that O is a finite linear combination of such exponentials, so is an entire 
function which is bounded on 


{z: a < Re (z) < b} 

for any finite a and b. We shall take a — % and b = 1, shall estimate O on the 
edges of this strip, and shall then apply Theorem 12.8 to estimate 0(l/p). 

For — oo < y < oo, define 

°n(y) = JV 1/2 F>^ dll. 


(7) 
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The Bessel inequality gives 

Z K(y)l : 

n = 1 

and then the Schwarz inequality shows that 


< \\F l,2 F iy q>\ 2 dfi= \\F\dp = l, 
Jx Jx 


The estimate 


Z By 2 B‘%c n 

n= 1 


N N "} 1/2 

<<! Z I Ic„l 2 f <1. 

n=l n= 1 J 


( 8 ) 

(9) 


1 0(1 + iy) | < M ( — oo < y < oo) 

follows trivially from (3), (4), and (6), since < M. 

We now conclude from (9), (10), and Theorem 12.8 that 

1 0(x + iy) | < M lx ~ 1 (i<x<l, — oo < y < oo). 

With x = 1/p and y = 0, this gives the desired inequality (2). 

The proof is now easily completed. Note first that 


N ) 1/q 

Z l/(»)l’ = SU P 

n = l J 


Z K /(») 

In = 1 


(10) 


(11) 


( 12 ) 


the supremum being taken over all {b u ...,b N } with £ | b n \ p = 1, since the I? 
norm of any function on any measure space is equal to its norm as a linear 
functional on II. Hence (2) shows that 




Z I/WI7 < M i2 ~ p)/p \\f\\ l 


(13) 


for every simple complex/ e B{p). 

If now / e II(p\ there are simple functions f 3 such that \\fj —f\\ p ^> 0 as 
j-* oo. Then f(n) for every n, because ij/ n e I3(p). Thus since (13) holds 
for each f j9 it also holds for / Since N was arbitrary, we finally obtain (1). //// 


A Converse of the Maximum Modulus Theorem 

We now come to the theorem which was alluded to in the introduction of the 
present chapter. 

The letter j will denote the identity function: j(z) = z. 

The function which assigns the number 1 to each z e U will be denoted by 1. 

12.13 Theorem Suppose M is a vector space of continuous complex functions 
on the closed unit disc £7, with the following properties: 

(a) 1 e M. 

(b) Iff e M, then also jf e M. 

(c) Iff £ M 9 then \\f\\ v = \\f\\ T - 

Then every f e M is holomorphic in U. 



THE MAXIMUM MODULUS PRINCIPLE 263 


Note that (c) is a rather weak form of the maximum modulus principle; (c) 
asserts only that the overall maximum of | / 1 on U is attained at some point of 
the boundary T, but (c) does not a priori exclude the existence of local maxima of 
I/I in U. 

Proof By (a) and (b), M contains all polynomials. In conjunction with (c), 
this shows that M satisfies the hypotheses of Theorem 5.25. Thus every / e M 
is harmonic in U. We shall use (b) to show that every / e M actually satisfies 
the Cauchy-Riemann equation. 

Let d and d be the differential operators introduced in Sec. 11.1. The 
product rule for differentiation gives 

mM =/ • (ddg) + m ■ (dg) + (df) • 0 dg ) + (88 f) • g. 

Fix / e M, and take g = j. Then fj e M. Hence / and fj are harmonic, so 
ddf = 0 and (dd)(fj) = 0. Also, dj = 0 and dj = 1. The above identity therefore 
reduces to df = 0. Thus / e H(U). //// 

This result will be used in the following proof. 

12.14 Rado’s Theorem Assume f e C(U) 9 Q is the set of all z e U at which f(z) ^ 0, 
and f is holomorphic in Q. Then f is holomorphic in U. 

In particular, the theorem asserts that U — Q is at most countable, unless 

n = 0. 

Proof Assume Q # 0. We shall first prove that Q is dense in U . If not, there 
exist a e Q and /? e U — Q such that 2\fi — a | < 1 — |/?|. Choose n so that 
2" | /(a) | > ||/|| r . Define h(z) = (z - p)' n f(z), forzefi. Ifzel/n dQ, then 
f(z) — 0, hence h(z) = 0. If z e T n d£2, then 

|^)I<(1 -|^l)" n ll/llr<|a-^r ,, l/WI = IMa)|. 

This contradicts the maximum modulus theorem. 

Thus Q is dense in U. 

Next, let M be the vector space of all g e C(U) that are holomorphic in 
Q. Fix g e M. For n = 1, 2, 3, ..., fg n = 0 on U n dQ. The maximum 
modulus theorem implies therefore, for every aeQ, that 

\m\ l0(a)|" < \\fg n u= \\fg n \\T< \\f\\T\\g\\ n T- 

If we take nth roots and then let n-> oo, we see that \g((x)\ < \\g\\ T , for every 
aeQ. Since Q is dense in U 9 \\g\\ v = \\g\\ T - 

It follows that M satisfies the hypotheses of Theorem 12.13. Since f e M, 
/is holomorphic in U. Ill / 
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Exercises 


1 Suppose A is a closed equilateral triangle in the plane, with vertices a, b, c. Find 
max ( | z — a\ \z — b\ \ z — c | ) as z ranges over A. 

2 Suppose / e H(II + ), where II + is the upper half plane, and |/| < 1. How large can |/'(i)| be? Find 
the extremal functions. (Compare the discussion in Sec. 12.5.) 

3 Suppose / e H(Q). Under what conditions can | / 1 have a local minimum in Q? 

4 ( a ) Suppose Q is a region, D is a disc, D a Q,/e H(Q),/is not constant, and |/| is constant on the 
boundary of D. Prove that / has at least one zero in D. 

(b) Find all entire functions / such that |/(z) | = 1 whenever | z | = 1. 

5 Suppose Q is a bounded region, {/„} is a sequence of continuous functions on Q which are holo- 
morphic in Q, and {/„} converges uniformly on the boundary of Q. Prove that {/„} converges uni- 
formly on Q. 

6 Suppose / e H(Q), T is a cycle in Q such that Ind r (a) = 0 for all a £ Q, | /(£) | < 1 for every £ e T*, 
and Ind r (z) # 0. Prove that | /(z) | < 1. 

7 In the proof of Theorem 12.8 it was tacitly assumed that M(a) > 0 and M{b) > 0. Show that the 
theorem is true if M(a) = 0, and that then /(z) = 0 for all z e Q. 

8 If 0 < R 1 < R 2 < o°, let A(R t , R 2 ) denote the annulus 

{z^dzIcR,}. 

There is a vertical strip which the exponential function maps onto A(R t , R 2 ). Use this to prove 
Hadamard’s three-circle theorem: If f e H(A(R 1 , R 2 )), if 

M(r) = max | /(re'*) | (R t < r < R 2 ), 
e 

and if R 1 < a < r < b < R 2 , then 


log M(r) < 


log (b/r) 

log (b/a) 


log M(a) + 


log (r/a) 
log (b/a) 


log M(b). 


[In other words, log M(r) is a convex function of log r.] For which / does equality hold in this 
inequality? 

9 Let II be the open right half plane (z e II if and only if Re z > 0). Suppose / is continuous on the 
closure of II (Re z > 0),/e H(II), and there are constants A < oo and a < 1 such that 


l/(z)| < A exp ( | z I*) 


for all z e II. Furthermore, | f(iy) | < 1 for all real y. Prove that | /(z) | < 1 in II. 

Show that the conclusion is false for a = 1. 

How does the result have to be modified if n is replaced by a region bounded by two rays 
through the origin, at an angle not equal to 7r? 

10 Let n be the open right half plane. Suppose that / e H( n), that | /(z) | < 1 for all z e n, and that 
there exists a, —n/2 < a < 7r/2, such that 


log | / (re i<x ) | 


Prove that / = 0. 

Hint: Put g n {z) = f{z)e nz , n = 1, 2, 3, .... Apply Exercise 9 to the two angular regions defined by 
— n/2 <0<a, ol < 0 < n/2. Conclude that each g n is bounded in n, and hence that | g n \ < 1 in n, for 
all n. 

11 Suppose T is the boundary of an unbounded region Q,/e H(Q),/ is continuous on Q u T, and 
there are constants B < oo and M < oo such that | / 1 < M on T and | / 1 < B in Q. Prove that we 
then actually have | / 1 < M in Q. 
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Suggestion : Show that it involves no loss of generality to assume that U n C2 = 0. Fix z 0 e £2, 
let n be a large integer, let V be a large disc with center at 0, and apply the maximum modulus 
theorem to the function /"(z)/z in the component of V n £2 which contains z 0 . 

12 Let /be an entire function. If there is a continuous mapping y of [0, 1) into the complex plane such 
that y(t) — ► oo and f(y{t)) — ► a as t— > 1, we say that a is an asymptotic value of /. [In the complex plane, 
“y(t)— > oo as t— ► 1” means that to each R < oo there corresponds a t R < 1 such that |y(r)| > R if 
t R < t < 1.] Prove that every nonconstant entire function has oo as an asymptotic value. 

Suggestion : Let E n = {z: |/(z)| > n}. Each component of E n is unbounded (proof?) and contains 
a component of E n + 15 by Exercise 1 1. 

13 Show that exp has exactly two asymptotic values: 0 and oo. How about sin and cos? Note: sin z 
and cos z are defined, for all complex z, by 

e iz -e~ iz e iz + e~ iz 

sin z = , cos z = . 

2 i 2 

14 If / is entire and if a is not in the range of / prove that a is an asymptotic value of / unless / is 
constant. 

15 Suppose / e H(U). Prove that there is a sequence {z„} in U such that |z„|— ► 1 and {/(z„)} is 
bounded. 

16 Suppose Q is a bounded region,/ e H(£2), and 

lim sup |/(z„)| < M 

n~* oo 

for every sequence {z„} in Q which converges to a boundary point of £2. Prove that \f{z)\<M for all 

z e Q. 

17 Let <S> be the set of all / e H(U) such that 0 < | /(z) | < 1 for z 6 17, and let <I> C be the set of all/e <S> 
that have /( 0) = c. Define 

M(c) = sup { | /'(0) | :/e <D C }, M = sup { |/'(0)|:/e <D}. 

Find M, and M(c ) for 0 < c < 1. Find an/ e $ with /'(0) = M, or prove that there is no such /. 

Suggestion : log / maps U into the left half plane. Compose log / with a properly chosen map that 
takes this half plane to U. Apply the Schwarz lemma. 
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APPROXIMATIONS BY RATIONAL 
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Preparation 

13.1 The Riemann Sphere It is often convenient in the study of holomorphic 
functions to compactify the complex plane by the adjunction of a new point 
called oo. The resulting set S 2 (the Riemann sphere , the union of R 2 and {oo}) is 
topologized in the following manner. For any r > 0, let D'( oo; r) be the set of all 
complex numbers z such that \z\>r, put D(oo; r) = D'(oo; r) u {oo}, and 
declare a subset of S 2 to be open if and only if it is the union of discs D(a ; r), 
where the a’s are arbitrary points of S 2 and the r’s are arbitrary positive numbers. 
On S 2 — {oo}, this gives of course the ordinary topology of the plane. It is easy to 
see that S 2 is homeomorphic to a sphere (hence the notation). In fact, a homeo- 
morphism cp of S 2 onto the unit sphere in R 3 can be explicitly exhibited: Put 
cp{ oo ) = (0, 0, 1), and put 


cp(re ie ) = 


/ 2 r cos 9 2 r sin 9 r 2 — 1\ 
\ r 2 + 1 ’ r 2 + 1 ’ r 2 + 1/ 


( 1 ) 


for all complex numbers re w . We leave it to the reader to construct the geometric 
picture that goes with (1). 

If /is holomorphic in D'(oo ; r), we say that / has an isolated singularity at oo. 
The nature of this singularity is the same as that which the function/ defined in 
D'(0; l/r) by/(z) =/(l/z), has at 0. 

Thus if /is bounded in D'{ oo; r), then 1 im z ^ a0 f(z) exists and is a complex 
number (as we see if we apply Theorem 10.20 to /), we define /(oo) to be this 
limit, and we thus obtain a function in D(oo; r) which we call holomorphic: note 
that this is defined in terms of the behavior of / near 0, and not in terms of 
differentiability of/ at oo. 


266 
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If / has a pole of order m at 0, then / is said to have a pole of order mat oo; 
the principal part of / at oo is then an ordinary polynomial of degree m (compare 
Theorem 10.21), and if we subtract this polynomial from f we obtain a function 
with a removable singularity at oo. 

Finally, if / has an essential singularity at 0, then /is said to have an essential 
singularity at oo. For instance, every entire function which is not a polynomial 
has an essential singularity at oo. 

Later in this chapter we shall encounter the condition “ S 2 — Q is connected,” 
where Q is an open set in the plane. Note that this is not equivalent to the 
condition “the complement of Q relative to the plane is connected.” For example, 
if Q consists of all complex z = x + iy with 0 < y < 1, the complement of Q rela- 
tive to the plane has two components, but S 2 — Q is connected. 

13.2 Rational Functions A rational function / is, by definition, a quotient of two 
polynomials P and Q :f= P/Q. It follows from Theorem 10.25 that every noncon- 
stant polynomial is a product of factors of degree 1. We may assume that P and 
Q have no such factors in common. Then / has a pole at each zero of Q (the pole 
of / has the same order as the zero of Q). If we subtract the corresponding prin- 
cipal parts, we obtain a rational function whose only singularity is at oo and 
which is therefore a polynomial. 

Every rational function / = P/Q has thus a representation of the form 

f(z) = A 0 (z) + (1) 

7=1 

where A 0 , A l9 . . . , A k are polynomials, A lf . . . , A k have no constant term, and a l9 
..., a k are the distinct zeros of Q\ (1) is called the partial fractions decomposition 
Off. 

We turn to some topological considerations. We know that every open set in 
the plane is a countable union of compact sets (closed discs, for instance). 
However, it will be convenient to have some additional properties satisfied by 
these compact sets : 

13.3 Theorem Every open set Q in the plane is the union of a sequence { K n }, 

n = 1, 2, 3, . . . , of compact sets such that 

(a) K n lies in the interior of K n + u for n = 1, 2, 3, — 

(b) Every compact subset of Q lies in some K n . 

(c) Every component of S 2 — K n contains a component of S 2 — Q,/or n = 1, 2, 
3,.... 

Property (c) is, roughly speaking, that K n has no holes except those which are 
forced upon it by the holes in Q. Note that Q is not assumed to be connected. 
The interior of a set E is, by definition, the largest open subset of E. 
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Proof For n = 1, 2, 3, , put 

K = D(cc ; n) u [J d( a; -) (1) 

and put K n = S 2 — V„. [Of course, a ^ oo in (1).] Then K n is a closed and 
bounded (hence compact) subset of Q, and Q = (J K n . If z e K n and 
r = n~ l — (n + l) -1 , one verifies easily that D(z ; r) K n + l . This gives (a). 
Hence Q is the union of the interiors W n of K n . If K is a compact subset of Q, 
then u • • • u W N for some AT, hence K a K N . 

Finally, each of the discs in (1) intersects S 2 — Cl; each disc is connected; 
hence each component of V n intersects S 2 — Q; since V n ~ ? S 2 — Q, no com- 
ponent of S 2 — Cl can intersect two components of V n . This gives (c). //// 

13.4 Sets of Oriented Intervals Let O be a finite collection of oriented intervals in 
the plane. For each point p, let be the number of members of O that 

have initial point [end point] p. If m 7 (p) = m E (p) for every p, we shall say that is 
balanced. 

If <S> is balanced (and nonempty), the following construction can be carried 

out. 

Pick = [a 0 , «i] e <t>. Assume k > 1, and assume that distinct members y u 

. .., y k of have been chosen in such a way that y t = la^ l9 aj for 1 < i < k. If 

a k = a 0 , stop. If a k ^ a 0 , and if precisely r of the intervals y l9 ..., y k have a k as 
end point, then only r — 1 of them have a k as initial point; since Q> is balanced, 
contains at least one other interval, say y k+1 , whose initial point is a k . Since <5> is 
finite, we must return to a 0 eventually, say at the nth step. 

Then y l9 . . . , y n join (in this order) to form a closed path. 

The remaining members of <S> still form a balanced collection to which the 
above construction can be applied. It follows that the members of <t> can be so 
numbered that they form finitely many closed paths. The sum of these paths is a 
cycle. The following conclusion is thus reached. 

If y N } is a balanced collection of oriented intervals , and if 

T = + ••• + y N 

then r is a cycle. 


13.5 Theorem If K is a compact subset of a plane open set Ci(^0\ then there 
is a cycle T in Cl — K such that the Cauchy formula 


/(*) = 


_L_ 

2ni 


' no 

r C ~ z 


dC 


( 1 ) 


holds for every f e H(C1) and for every z e K. 


Proof Since K is compact and Q is open, there exists an rj > 0 such that the 
distance from any point of K to any point outside Q is at least 2 rj. Construct 
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a grid of horizontal and vertical lines in the plane, such that the distance 
between any two adjacent horizontal lines is rj, and likewise for the vertical 
lines. Let Q u . .., Q m be those squares (closed 2-cells) of edge y\ which are 
formed by this grid and which intersect K. Then Q r c= Q for r = 1, . . . , m. 

If a r is the center of Q r and a r + b is one of its vertices, let y rk be the 
oriented interval 


y r k = L a r + i k b, a r + i k+1 b] 


( 2 ) 


and define 


dQr = 7rl + 7r2 + ?r3 + 7r4 (r = 1, . . ffl). (3) 


It is then easy to check (for example, as a special case of Theorem 10.37, or 
by means of Theorems 10.11 and 10.40) that 


Ind 8Cr (a) = 


if a is in the interior of Q r , 
if a is not in Q r . 


( 4 ) 


Let Z be the collection of all y rk (1 < r < m, 1 < k < 4). It is clear that Z 
is balanced. Remove those members of Z whose opposites (see Sec. 10.8) also 
belong to Z. Let Q> be the collection of the remaining members of Z. Then Q> 
is balanced. Let T be the cycle constructed from <I>, as in Sec. 13.4. 

If an edge E of some Q r intersects K , then the two squares in whose 
boundaries E lies intersect K. Hence Z contains two oriented intervals which 
are each other’s opposites and whose range is E. These intervals do not occur 
in Q>. Thus T is a cycle in Q — K. 

The construction of Q> from Z shows also that 


m 

Ind r (a) = X Ind 8G, («) 


r = 1 


( 5 ) 


if a is not in the boundary of any Q r . Hence (4) implies 


Ind r (a) = 


if a is in the interior of some Q r , 
if a lies in no Q r . 


( 6 ) 


If z e K, then z 4 r*, and z is a limit point of the interior of some Q r . 
Since the left side of (6) is constant in each component of the complement of 
T*, (6) gives 


Ind r (z) = 


{o 


if z g K, 
if z £ Q. 


( 7 ) 


Now (1) follows from Cauchy’s theorem 10.35. 


mi 
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Runge’s Theorem 

The main objective of this section is Theorem 13.9. We begin with a slightly 
different version in which the emphasis is on uniform approximation on one 
compact set. 

13.6 Theorem Suppose K is a compact set in the plane and {a,} is a set which 
contains one point in each component of S 2 — K. If Cl is open , Q K,f e H(Q), 
and 6 > 0, there exists a rational function R , all of whose poles lie in the pre- 
scribed set { (Xj }, such that 


\f(z)-R(z)\<e 


(1) 


for every z e K. 


Note that S 2 — K has at most countably many components. Note also that 
the preassigned point in the unbounded component of S 2 — K may very well be 
oo ; in fact, this happens to be the most interesting choice. 

Proof We consider the Banach space C(K) whose members are the contin- 
uous complex functions on K , with the supremum norm. Let M be the sub- 
space of C(K) which consists of the restrictions to K of those rational 
functions which have all their poles in {a,-}. The theorem asserts that /is in 
the closure of M. By Theorem 5.19 (a consequence of the Hahn-Banach 
theorem), this is equivalent to saying that every bounded linear functional on 
C(K) which vanishes on M also vanishes at f and hence the Riesz represent- 
ation theorem (Theorem 6.19) shows that we must prove the following asser- 
tion: 

If ju is a complex Bor el measure on K such that 

[* 4.-0 ( 2 ) 

for every rational function R with poles only in the set {a,}, and if f e H(Q ), 
then we also have 


(% 

JK 


fdfi = 0 . 


So let us assume that n satisfies (2). Define 

M0 


h(z) 


/» 


C - Z 


(zeS 2 - K). 


( 3 ) 


(4) 


By Theorem 10.7 (with X = K, cp(Q = (), h e H(S 2 — K). 
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Let Vj be the component of S 2 — K which contains ocj , and suppose 
D(<Xji r) a Vj. If a 7 ^ oo and if z is fixed in D(a 7 ; r), then 


1 


C — Z 


lim 

N-* oo 


f (z - «,)" 
,-0 (C - «/ + 1 


(5) 


uniformly for (e K. Each of the functions on the right of (5) is one to which 
(2) applies. Hence h(z) = 0 for all z e D(a y ; r). This implies that h(z) = 0 for all 
z e Vj, by the uniqueness theorem 10.18. 

If olj= oo, (5) is replaced by 

-!—= -lim £ z-"- 1 ? Re K,\z\>r\ (6) 

S ~ Z N-* oo >1 = 0 


which implies again that /i(z) = 0 in D( oo; r), hence in 1^.. We have thus 
proved from (2) that 


h(z) = 0 (zeS 2 - K). 


(7) 


Now choose a cycle T in Q — X, as in Theorem 13.5, and integrate this 
Cauchy integral representation of / with respect to An application of 
Fubini’s theorem (legitimate, since we are dealing with Borel measures and 
continuous functions on compact spaces), combined with (7), gives 



Hal 

55 j/ w d ~ 


m 


dw 


w-C 

f Ml 

JkW-C 


27ri 


I 


/(w)/i(w) dw = 0. 


The last equality depends on the fact that T* Q — K, where h(w) = 0. 

Thus (3) holds, and the proof is complete. //// 


The following special case is of particular interest. 

13.7 Theorem Suppose K is a compact set in the plane , S 2 — K is connected , 
and f e H(Q), where Q is some open set containing K. Then there is a sequence 
{P n } of polynomials such that P„(z)— >/(z) uniformly on K. 

Proof Since S 2 — K has now only one component, we need only one point 
ocj to apply Theorem 13.6, and we may take a 7 - = oo. //// 

13.8 Remark The preceding result is false for every compact K in the 
plane such that S 2 — K is not connected. For in that case S 2 — K has a 
bounded component V. Choose a e V, put /(z) = (z — a) -1 , and put 
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m = max { |z — a| : ze K}. Suppose P is a polynomial, such that 
| P(z) — f (z) | < 1/m for all z e X. Then 

| (z — a)P(z) — 1 1 < 1 (zeX). (1) 

In particular, (1) holds if z is in the boundary of V; since the closure of V is 
compact, the maximum modulus theorem shows that (1) holds for every 
zeK; taking z = a, we obtain 1 < 1. Hence the uniform approximation is 
not possible. 

The same argument shows that none of the a 7 can be dispensed with in 
Theorem 13.6. 

We now apply the preceding approximation theorems to approximation 
in open sets. Let us emphasize that X was not assumed to be connected in 
Theorems 13.6 and 13.7 and that Q will not be assumed to be connected 
in the theorem which follows. 


13.9 Theorem Let Q be an open set in the plane , let A be a set which has one 
point in each component of S 2 — Q, and assume f e H(Q). Then there is a 
sequence {R n } of rational functions , with poles only in A , such that ►/ uni- 
formly on compact subsets of Q. 

In the special case in which S 2 — Q is connected , we may take A — { oo} 
and thus obtain polynomials P n such that P n ^f uniformly on compact subsets 
of Cl. 


Observe that S 2 — Q may have uncountably many components; for instance, 
we may have S 2 — Q = {oo} u C, where C is a Cantor set. 

Proof Choose a sequence of compact sets K n in Q, with the properties speci- 
fied in Theorem 13.3. Fix n, for the moment. Since each component of 
S 2 — K n contains a component of S 2 — Q, each component of S 2 — K n con- 
tains a point of A , so Theorem 13.6 gives us a rational function R n with poles 
in A such that 


\RJtz)-f(z)\<- (zeK n ). (1) 

n 

If now K is any compact set in Q, there exists an N such that K a K n for 
all n > N. It follows from (1) that 


|fl B (z)-/(z)|<i (zeK,n>N), 


(2) 


which completes the proof. 


!h V 
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The Mittag-Leffler Theorem 

Kunge’s theorem will now be used to prove that meromorphic functions can be 
constructed with arbitrarily preassigned poles. 

13.10 Theorem Suppose Q is an open set in the plane , A a Q, A has no limit 
point in Q, and to each a e A there are associated a positive integer m(a) and a 
rational function 


m(a) 

P*(z)= £ Cj, a(z-a) - -'- 

j= 1 

Then there exists a meromorphic function f in Q, whose principal part at each 
a e A is P a and which has no other poles in Q. 

Proof We choose a sequence {K n } of compact sets in Q, as in Theorem 13.3: 
For n = 1, 2, 3, . . . , K n lies in the interior of K n+ 1? every compact subset of Q 
lies in some K n , and every component of S 2 — K n contains a component of 

S 2 — Q. Put A l = A n K l9 and A n = A n ( K n — K n _ l ) for n — 2, 3, 4, 

Since A n cl K n and A has no limit point in Q (hence none in K n ), each A n is a 
finite set. Put 


Qn(z) = I PM (»« 1,2,3,...). (1) 

ae A n 

Since each A n is finite, each Q n is a rational function. The poles of Q n lie in 
K n — K n _ u for n > 2. In particular, Q n is holomorphic in an open set con- 
taining It now follows from Theorem 13.6 that there exist rational 

functions R n , all of whose poles are in S 2 — Q, such that 

\R n (z)-Q n (z)\<2~ n (zeK^J. (2) 

We claim that 

f(z) = Gi(z) + t (e»(2) - (Z 6 Q) (3) 

n — 2 

has the desired properties. 

Fix N. On K n , we have 

/= Ql + I (Qn ~Rn)+ 1 (Qn ~ K)- (4) 

n= 2 N + 1 

By (2), each term in the last sum in (4) is less than 2~ n on K N ; hence this last 
series converges uniformly on K N , to a function which is holomorphic in the 
interior of K N . Since the poles of each R n are outside Q, 


/-(ei + -” + e*) 
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is holomorphic in the interior of K N . Thus / has precisely the prescribed 
principal parts in the interior of K N , and hence in Q, since N was arbitrary. 

Ill / 


Simply Connected Regions 

We shall now summarize some properties of simply connected regions (see 
Sec. 10.38) which illustrate the important role that they play in the theory of 
holomorphic functions. Of these properties, (a) and ( b ) are what one might call 
internal topological properties of Q; (c) and ( d ) refer to the way in which Q is 
embedded in S' 2 ; properties ( e ) to (h) are analytic in character; (j) is an algebraic 
statement about the ring H(Q). The Riemann mapping theorem 14.8 is another 
very important property of simply connected regions. In fact, we shall use it to 
prove the implication (j) — > (a). 

13.11 Theorem For a plane region Q, each of the following nine conditions 
implies all the others. 

(a) Q is homeomorphic to the open unit disc U. 

(b) Q is simply connected. 

(c) Ind y (a) = 0 for every closed path y in Q and for every a e S 2 — Q. 

(d) S 2 — Q is connected. 

(e) Every f e H(Q) can be approximated by polynomials , uniformly on compact 
subsets of Q. 

(/) For every f e H(Q) and every closed path y in Q, 

jV(z) dz = 0. 

(g) T o every f e H(Q) corresponds an F e H(Q) such that F' = /. 

(h) Iff e H(Q) and 1// e H(Q), there exists age H(Q) such that f = exp (g). 

(j) Iff e H(Q) and 1// e H(Q), there exists a(pe H(Q) such that f = cp 2 . 

The assertion of (h) is that / has a “holomorphic logarithm” g in Q; (j) 
asserts that / has a “holomorphic square root” cp in Q; and (/) says that the 
Cauchy theorem holds for every closed path in a simply connected region. 

In Chapter 16 we shall see that the monodromy theorem describes yet 
another characteristic property of simply connected regions. 

Proof (a) implies ( b ). To say that Q is homeomorphic to U means that there 
is a continuous one-to-one mapping ^ of Q onto U whose inverse if/~ l is also 
continuous. If y is a closed curve in Q, with parameter interval [0, 1], put 

H(s, t) = <p~ l (til/(y(s))). 

Then H: I 2 —>Q is continuous; H(s, 0) = i/'~ 1 (0), a constant; H(s, 1) = y(s); 
and H( 0, t) = H( 1, t) because y(0) = y(l). Thus Q is simply connected. 
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( b ) implies (c). If ( b ) holds and y is a closed path in Q, then y is (by 
definition of “ simply connected ”) Q-homotopic to a constant path. Hence (c) 
holds, by Theorem 10.40. 

(c) implies (d). Assume ( d ) is false. Then S 2 — Q is a closed subset of S 2 
which is not connected. As noted in Sec. 10.1, it follows that S 2 — Q is the 
union of two nonempty disjoint closed sets H and K. Let H be the one that 
contains oo. Let W be the complement of H, relative to the plane. Then 
W = Q u K. Since K is compact, Theorem 13.5 (with /= 1) shows that there 
is a cycle T in W — K = Q such that Ind r (z) = 1 for z e K. Since K j=- 0, (c) 
fails. 

(d) implies (e). This is part of Theorem 13.9. 

(e) implies (/). Choose / e H(Q), let y be a closed path in Q, and choose 
polynomials P n which converge to / uniformly on y*. Since P n (z) dz = 0 
for all n, we conclude that (/) holds. 

(/) implies ( g ). Assume (/) holds, fix z 0 e Q, and put 

F(z) = f dC (z e Q) (1) 

Jr (z) 

where T(z) is any path in Q from z 0 to z. This defines a function F in Q. For if 
r t (z) is another path from z 0 to z (in Q), then T followed by the opposite of 
r A is a closed path in Q, the integral of / over this closed path is 0, so (1) is 
not affected if T(z) is replaced by T^z). We now verify that F' = /. Fix a e Q. 
There exists an r > 0 such that D(a ; r) c= Q. For z e D(a ; r) we can compute 
F(z) by integrating/ over a path T(a), followed by the interval [a, z]. Hence, 
for z e D'(a ; r), 


m - m 

z — a 



r , 

J[a, z] 


/(C) d(, 


( 2 ) 


and the continuity of / at a implies now that F(a) = / (a), as in the proof of 
Theorem 10.14. 

(i g ) implies (h). Iff e H(Q) and /has no zero in Q, then /'// e H(Q), and ( g ) 
implies that there exists age H(Q) so that g' = /'// We can add a constant 
to 0 , so that exp {g(z 0 )} = f(z 0 ) for some z 0 e Q. Our choice of g shows that 
the derivative of fe~ 9 is 0 in Q, hence fe ~ 9 is constant (since Q is connected), 
and it follows that / = e 9 . 

(h) implies (/')• By (h),f= e 9 . Put (p = exp Qg). 

( j ) implies (a). If Q is the whole plane, then Q is homeomorphic to U : 
map z to z/(l H- |z|). 

If Q is a proper subregion of the plane which satisfies (/), then there 
actually exists a holomorphic homeomorphism of Q onto U (a conformal 
mapping). This assertion is the Riemann mapping theorem, which is the main 
objective of the next chapter. Hence the proof of Theorem 13.11 will be com- 
plete as soon as the Riemann mapping theorem is proved. (See the note 
following the statement of Theorem 14.8.) //// 
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The fact that ( h ) holds in every simply connected region has the following 
consequence (which can also be proved by quite elementary means): 

13.12 Theorem Iff e H(Q), where Q is any open set in the plane , and if f has no 
zero in Q, then log | / 1 is harmonic in Q. 

Proof To every disc DcQ there corresponds a function g e H(D) such that 
/ = e g in D. If u = Re g 9 then u is harmonic in D, and | / 1 = e u . Thus log | / 1 is 
harmonic in every disc in Q, and this gives the desired conclusion. //// 


Exercises 


1 Prove that every meromorphic function on S 2 is rational. 

2 Let O = {z: | z | < 1 and 1 2z — 1 1 > 1}, and suppose / e H(Cl). 

(< a ) Must there exist a sequence of polynomials P n such that P n — >f uniformly on compact 
subsets of O? 

(b) Must there exist such a sequence which converges to / uniformly in O? 

(c) Is the answer to ( b ) changed if we require more of /, namely, that / be holomorphic in some 
open set which contains the closure of ft? 

3 Is there a sequence of polynomials P n such that P„(0) = 1 for n = 1, 2, 3, . . . , but P„(z) — > 0 for every 
z # 0, as n — > oo ? 

4 Is there a sequence of polynomials P„ such that 


lim P„(z) = 


1 

0 

-1 


if Im z > 0, 
if z is real, 

if Im z < 0? 


5 For n = 1, 2, 3, ..., let A„ be a closed disc in U, and let L n be an arc (a homeomorphic image of 
[0, 1]) in U — A„ which intersects every radius of U. There are polynomials P„ which are very small 
on A„ and more or less arbitrary on L n . Show that {A„}, {L„}, and {P„} can be so chosen that the 
series /=IP„ defines a function / e H(U) which has no radial limit at any point of T. In other words, 
for no real 0 does lim r ^ t f(re ie ) exist. 

6 Here is another construction of such a function. Let {n k } be a sequence of integers such that n t > 1 
and n k+l > 2 kn k . Define 


h(z) = I 5‘z”*. 

k= 1 

Prove that the series converges if | z | < 1 and prove that there is a constant c > 0 such that 
| h(z ) | > c • 5 m for all z with | z | = 1 — (1/nJ. [Hint: For such z the mth term in the series defining h{z ) 
is much larger than the sum of all the others.] 

Hence h has no finite radial limits. 

Prove also that h must have infinitely many zeros in U. (Compare with Exercise 15, Chap. 12.) 
In fact, prove that to every complex number a there correspond infinitely many z e U at which 
h{z ) = a. 

7 Show that in Theorem 13.9 we need not assume that A intersects each component of S 2 — D. It is 
enough to assume that the closure of A intersects each component of S 2 — D. 

8 Prove the Mittag-LefTler theorem for the case in which D is the whole plane, by a direct argument 
which makes no appeal to Runge’s theorem. 

9 Suppose D is a simply connected region, / e H(Q), f has no zero in Q, and n is a positive integer. 
Prove that there exists a g e H(Q) such that g n =f 
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10 Suppose ft is a region, / e H(Q), and / # 0. Prove that / has a holomorphic logarithm in ft if and 
only if / has holomorphic nth roots in ft for every positive integer n. 

11 Suppose that f„ e H(G) ( n = 1, 2, 3, ...),/ is a complex function in ft, and /(z) = lim,,.^ /„(z ) for 
every zed. Prove that ft has a dense open subset V on which / is holomorphic. Hint : Put 
(p — sup \f n \. Use Baire’s theorem to prove that every disc in ft contains a disc on which (p is 
bounded. Apply Exercise 5, Chap. 10. (In general, V # ft. Compare Exercises 3 and 4.) 

12 Suppose, however, that / is any complex- valued measurable function defined in the complex plane, 
and prove that there is a sequence of holomorphic polynomials P n such that lim,,.^ P„(z) = /(z) for 
almost every z (with respect to two-dimensional Lebesgue measure). 



CHAPTER 

FOURTEEN 

CONFORMAL MAPPING 


Preservation of Angles 

14.1 Definition Each complex number z i=- 0 determines a direction from the 
origin, defined by the point 



(1) 


on the unit circle. 

Suppose / is a mapping of a region Q into the plane, z 0 e Q, and z 0 has a 
deleted neighborhood D'(z 0 ; r) c= Q in which /(z) ^ /(z 0 ). We say that / pre- 
serves angles at z 0 if 

lim e~ ie A[f(z 0 + re ie ) -/(z 0 )] (r > 0) (2) 

r-0 

exists and is independent of 6. 

In less precise language, the requirement is that for any two rays L and 
L\ starting at z 0 , the angle which their images / (L) and / (L!) make at / (z 0 ) is 
the same as that made by L and L\ in size as well as in orientation. 

The property of preserving angles at each point of a region is character- 
istic of holomorphic functions whose derivative has no zero in that region. 
This is a corollary of Theorem 14.2 and is the reason for calling holomorphic 
functions with nonvanishing derivative conformal mappings . 

14.2 Theorem Let f map a region Q into the plane. If f'(z 0 ) exists at some 
z 0 e Q and f'(z 0 ) ^ 0, then f preserves angles at z 0 . Conversely , if the differen- 
tial of f exists and is different from 0 at z 0 , and if f preserves angles at z 0 , then 
f'(z 0 ) exists and is different from 0. 
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Here f\z 0 ) = lim [/(z) — /(z 0 )]/(z — z 0 ), as usual. The differential of / at z 0 is 
a linear transformation L of R 2 into R 2 such that, writing z 0 = (x 0 , y 0 \ 

f(x 0 + X, y 0 + y) =/(x 0 , y 0 ) + L(x, y) + (x 2 + y 2 ) ll2 ri(x, y), (1) 

where rj(x , y)— ► 0 as x— ► 0 and y — ► 0, as in Definition 7.22. 

Proof Take z 0 =/(z 0 ) = 0, for simplicity. If /'( 0) = a ^ 0, then it is imme- 
diate that 


e 


w Alf(re ie )-] = 


e- i6 f(re ie ) 

\f(re w )\ 


a 

H 


(r-> 0), 


(2) 


so /preserves angles at 0. Conversely, if the differential of / exists at 0 and is 
different from 0, then (1) can be rewritten in the form 


/(z) = ocz + )3z + \z\ri(z), (3) 

where »/(z)— > 0 as z— > 0, and a and ft are complex numbers, not both 0. If / 
also preserves angles at 0, then 

(4) 

exists and is independent of 0. We may exclude those 0 for which the 
denominator in (4) is 0; there are at most two such 0 in [0, 2n). For all other 
0 , we conclude that a + fie~ 2i0 lies on a fixed ray through 0, and this is 
possible only when /? = 0. Hence a ^ 0, and (3) implies that /'( 0) = a. //// 


Note : No holomorphic function preserves angles at any point where its 
derivative is 0. We omit the easy proof of this. However, the differential of a 
transformation may be 0 at a point where angles are preserved. Example: f(z) = 

I Z | z, z 0 = 0. 


Linear Fractional Transformations 


14.3 If a, b, c, and d are complex numbers such that ad — be ^ 0, the mapping 


az + b 
cz -b d 


(1) 


is called a linear fractional transformation. It is convenient to regard (1) as a 
mapping of the sphere S 2 into S 2 , with the obvious conventions concerning the 
point oo. For instance, —d/c maps to oo and oo maps to a/c, if c ^ 0. It is then 
easy to see that each linear fractional transformation is a one-to-one mapping of 
S 2 onto S 2 . Furthermore, each is obtained by a superposition of transformations 
of the following types: 

(a) Translations: z->z + &. 

(b) Rotations: z— ► az, | a \ = 1. 
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(c) Homotheties: z— ► rz, r > 0. 

(i d ) Inversion : z — ► 1/z. 

If c = 0 in (1), this is obvious. If c ^ 0, it follows from the identity 

az + b a 2 „ be — ad 

~i ~A = “ + > A = . (2) 

cz + a c cz + d c 

The first three types evidently carry lines to lines and circles to circles. This is 
not true of ( d ). But if we let F be the family consisting of all straight lines and all 
circles, then F is preserved by (d\ and hence we have the important result that 
F is preserved by every linear fractional transformation. [It may be noted that 
when F is regarded as a family of subsets of S 2 , then F consists of all circles on 
S 2 , via the stereographic projection 13.1(1); we shall not use this property of F 
and omit its proof.] 

The proof that F is preserved by inversion is quite easy. Elementary analytic 
geometry shows that every member of F is the locus of an equation 

azz -f Pz + fiz + y = 0, (3) 

where a and y are real constants and P is a complex constant, provided that 
PP > a y. If a ^ 0, (3) defines a circle; a = 0 gives the straight lines. Replacement 
of z by 1/z transforms (3) into 

a + P~z + Pz + yzz = 0, (4) 

which is an equation of the same type. 

Suppose a, b, and c are distinct complex numbers. We construct a linear 
fractional transformation cp which maps the ordered triple {a, b 9 c } into 
{0, 1, oo }, namely, 


<p(z) = 


(b — c)(z — a) 
(b — a)(z — c) 


( 5 ) 


There is only one such cp. For if cp(a) = 0, we must have z — a in the numer- 
ator; if cp(c) = oo, we must have z — c in the denominator; and if cp(b) = 1 , we are 
led to (5). If a or b or c is oo, formulas analogous to (5) can easily be written 
down. If we follow (5) by the inverse of a transformation of the same type, we 
obtain the following result : 

For any two ordered triples {a, b, c } and {a\ b', c'} in S 2 there is one and only 
one linear fractional transformation which maps a to a\ b to b\ and c to c'. 

(It is of course assumed that a ^ h, a ^ c, and Me, and likewise for a\ b\ 
and c'.) 

We conclude from this that every circle can be mapped onto every circle by a 
linear fractional transformation. Of more interest is the fact that every circle can 
be mapped onto every straight line (if oo is regarded as part of the line), and 
hence that every open disc can be conformally mapped onto every open half plane. 
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Let us discuss one such mapping more explicitly, namely, 

<p(z) = 7-^- • (6) 

1 — z 

This cp maps { — 1, 0, 1} to {0, 1, 00 } ; the segment (—1, 1) maps onto the positive 
real axis. The unit circle T passes through — 1 and 1; hence cp(T) is a straight line 
through (p(— 1) = 0. Since T makes a right angle with the real axis at —1, (p(T) 
makes a right angle with the real axis at 0. Thus cp(T) is the imaginary axis. Since 
cp( 0) = 1 , it follows that cp is a conformal one-to-one mapping of the open unit disc 
onto the open right half plane. 

The role of linear fractional transformations in the theory of conformal 
mapping is also well illustrated by Theorem 12.6. 


14.4 Linear fractional transformations make it possible to transfer theorems con- 
cerning the behavior of holomorphic functions near straight lines to situations 
where circular arcs occur instead. It will be enough to illustrate the method with 
an informal discussion of the reflection principle. 

Suppose Q is a region in U , bounded in part by an arc L on the unit circle, 
and /is continuous on Q, holomorphic in Q, and real on L. The function 


•A(z) = 


z — i 
z + i 


(i) 


maps the upper half plane onto U. If g =/ ° Theorem 11.14 gives us a holo- 
morphic extension G of g, and then F = G ° ij/ 1 gives a holomorphic extension 
F off which satisfies the relation 

/(z*) = m (2), 


where z* = 1/z. 

The last assertion follows from a property of 1 /^: If w = 1 l/(z) and = 1 j/(z) 9 
then = w*, as is easily verified by computation. 

Exercises 2 to 5 furnish other applications of this technique. 

Normal Families 

The Riemann mapping theorem will be proved by exhibiting the mapping func- 
tion as the solution of a certain extremum problem. The existence of this solution 
depends on a very useful compactness property of certain families of holo- 
morphic functions which we now formulate. 

14.5 Definition Suppose SF c= H( Q), for some region Q. We call 2F a normal 
family if every sequence of members of 2F contains a subsequence which con- 
verges uniformly on compact subsets of Q. The limit function is not required 
to belong to 3F . 
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(Sometimes a wider definition is adopted, by merely requiring that every 
sequence in <F either converges or tends to oo, uniformly on compact subsets of 
Q. This is well adapted for dealing with meromorphic functions.) 

14.6 Theorem Suppose 3F c H(Q) and 3F is uniformly bounded on each 
compact subset of the region Q. Then $F is a normal family. 

Proof The hypothesis means that to each compact K c= Q there corresponds 
a number M(K) < oo such that | f(z) \ < M(K) for all/ e & and all z g K. 

Let {K n } be a sequence of compact sets whose union is Q, such that K n 
lies in the interior of K n + l ; such a sequence was constructed in Theorem 
13.3. Then there exist positive numbers S n such that 

D(z;25 n )^K n+1 (z e K„). (1) 

Consider two points z' and z" in K n , such that | z' — z” \ < <5„ , let y be the 
positively oriented circle with center at z f and radius 2 <5„, and estimate 
| f(z') —f(z") | by the Cauchy formula. Since 

1 1 z' - z" 

C-z'~t-z"~((-z')((-z") 9 


we have 


m -nn = 


z' — z” 
2ni 


m 

« - m - n 


dt, 


(2) 


and since — z'j = 2 S„ and | C — z" \ > S„ for all £ e y*, (2) gives the inequal- 
ity 


| f(z') - f(z ") I < M( ^ n + l) |Z'-Z"|, (3) 

valid for allf e 3F and all z' and z" e K n , provided that | z' — z” | < S n . 

This was the crucial step in the proof : We have proved, for each K n , that 
the restrictions of the members of to K n form an equicontinuous family. 

If f g 3F, for j = 1, 2, 3, ..., Theorem 11.28 implies therefore that there 
are infinite sets S n of positive integers, S t => S 2 => S 3 => • • • , so that {/•} con- 
verges uniformly on K n as j — ► oo within S n . The diagonal process yields then 
an infinite set S such that {/} converges uniformly on every K n (and hence 
on every compact K cz Q) as j — ► oo within S. //// 

The Riemann Mapping Theorem 

14.7 Conformal Equivalence We call two regions and Q 2 conformally equiva- 
lent if there exists a q> g //(OJ such that cp is one-to-one in and such that 
^(QJ = Q 2 , i.e., if there exists a conformal one-to-one mapping of onto Q 2 . 
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Under these conditions, the inverse of (p is holomorphic in Q 2 (Theorem 10.33) 
and hence is a conformal mapping of Q 2 onto ^i- 

It follows that conformally equivalent regions are homeomorphic. But there 
is a much more important relation between conformally equivalent regions: If (p 
is as above, /— ►/ ° (p is a one-to-one mapping of H(Q 2 ) onto //(Q^ which pre- 
serves sums and products, i.e., which is a ring isomorphism of H(Q 2 ) onto H(Q x ). 
If has a simple structure, problems about H(Q 2 ) can be transferred to prob- 
lems in H(Q x) and the solutions can be carried back to H(Q 2 ) with the aid of the 
mapping function cp. The most important case of this is based on the Riemann 
mapping theorem (where Q 2 is the unit disc U\ which reduces the study of H(Q) 
to the study of H(U\ for any simply connected proper subregion of the plane. Of 
course, for explicit solutions of problems, it may be necessary to have rather 
precise information about the mapping function. 

14.8 Theorem Every simply connected region Q in the plane (other than the 
plane itself) is conformally equivalent to the open unit disc U. 


Note: The case of the plane clearly has to be excluded, by Liouville’s 
theorem. Thus the plane is not conformally equivalent to U, although the two 
regions are homeomorphic. 

The only property of simply connected regions which will be used in the 
proof is that every holomorphic function which has no zero in such a region has 
a holomorphic square root there. This will furnish the conclusion “ (J) implies (a) ” 
in Theorem 13.1 1 and will thus complete the proof of that theorem. 

Proof Suppose Q is a simply connected region in the plane and let w 0 be a 
complex number, w 0 £ Q. Let £ be the class of all ij/ e H(Q) which are one- 
to-one in Q and which map Q into U. We have to prove that some ij/ e £ 
maps Q onto U. 

We first prove that £ is not empty. Since Q is simply connected, there 
exists a (p e H(Q) so that cp 2 (z) — z — w 0 in Q. If ^(z^ = cp(z 2 ), then also 
(p 2 (z 1 ) = (p 2 (z 2 ), hence z 1 =z 2 ; thus cp is one-to-one. The same argument 
shows that there are no two distinct points z i and z 2 in Q such that <^(z x ) = 
— (p(z 2 ). Since cp is an open mapping, cp(Q) contains a disc D(a ; r), with 
0 < r < \a\. The disc D( — a; r) therefore fails to intersect <p(Q), and if we 
define ij/ = r/(cp 4- a), we see that fel 

The next step consists in showing that if f e I, if ij/(Q) does not cover all 
of U, and if z 0 e Q, then there exists with 

l*Ai'(zo)l > l*A'(zo)l- 

It will be convenient to use the functions (p a defined by 


<?«*(*) = 


z — a 
1 — az 
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For a e U, (p a is a one-to-one mapping of U onto U; its inverse is <p_ a 
(Theorem 12.4). 

Suppose ij/ e £, a e U, and a £ ^(Q). Then <p a o ijj e £, and <p a o ^ has no 
zero in Q; hence there exists a <7 e //(Q) such that g 2 = (p a 0 i/c We see that <7 
is one-to-one (as in the proof that £ ^ 0 ), hence g e £; and if 1 /^ = o < 7 , 
where ft = gf(z 0 ), it follows that 1 ^ eE. With the notation w 2 = s(w), we now 
have 


* A = <P-a o S o g = (p_ a 0 S o (p_ fi o [// v 

Since <Ai(z 0 ) = 0, the chain rule gives 

'/''(Zo) = F'(0)^'i(z o ), 

where F = <p_ a °s°<p_ 0 . We see that F(U) cz U and that F is not one-to- 
one in U. Therefore | F'( 0)| < 1, by the Schwarz lemma (see Sec. 12.5), so that 
| il/'(z 0 ) | < |^'i(z 0 ) | . [Note that iA'( z o) ^ 0, since \j/ is one-to-one in Q.] 

Fix z 0 e Q, and put 


t 1 = sup { | </' , (z 0 ) I : ip e 2 }. 

The foregoing makes it clear that any /iel /or vv/nc/i | h'(z 0 ) | = fy wi// map Q 
onto U. Hence the proof will be completed as soon as we prove the existence 
of such an h. 

Since 1 1 j/(z) \ < 1 for all ij/ e £ and zeQ, Theorem 14.6 shows that £ is a 
normal family. The definition of rj shows that there is a sequence { 1 //„} in £ 
such that | il/' n (z 0 ) | — > rj , and by normality of £ we can extract a subsequence 
(again denoted by for simplicity) which converges, uniformly on 

compact subsets of Q, to a limit h e H(Q . ). By Theorem 10.28, | h'(z 0 ) | = rj . 
Since £ ^ 0, rj > 0, so h is not constant. Since ^„(Q) c= 17, for n = 1, 2, 3, ... , 
we have h(Q) c= U , but the open mapping theorem shows that actually 
/i(Q) cl/. 

So all that remains to be shown is that h is one-to-one. Fix distinct 
points z l and z 2 e Q; put a = h(z x ) and <x n = iA n ( z i) for n = 1, 2, 3, ... ; and let 
D be a closed circular disc in Q with center at z 2 , such that z x $ D and such 
that h — a has no zero on the boundary of D. This is possible, since the zeros 
of h — a have no limit point in Q. The functions 1 j/ n — a„ converge to h — a, 
uniformly on D ; they have no zero in D since they are one-to-one and have a 
zero at z x ; it now follows from Rouche’s theorem that h — a has no zero in 
D ; in particular, /i(z 2 ) ^ /^Zj). 

Thus /i e £, and the proof is complete. I HI 


A more constructive proof is outlined in Exercise 26. 



CONFORMAL MAPPING 285 


14.9 Remarks The preceding proof also shows that h(z 0 ) = 0. For if h(z 0 ) = ft 
and ft 0, then cp p o h e £, and 

I (<P, » h)'(z 0 ) | = | <p'0)h'(z o ) | = > | h'(z 0 ) | . 

It is interesting to observe that although h was obtained by maximizing 
1 1 l/'(z 0 ) | for i// e Z, h also maximizes | f'(z 0 ) | if / is allowed to range over the 
class consisting of all holomorphic mappings of Q into U (not necessarily 
one-to-one). For if / is such a function, then g =f o h~ l maps U into U , 
hence | g'(0) | < 1, with equality holding (by the Schwarz lemma) if and only if 
g is a rotation, so the chain rule gives the following result : 

If f e //(Q),/(Q) c= U , and z 0 e Q, then \f\z 0 ) \ < \h’(z 0 )\. Equality holds 
if and only if f(z) = Xh(z\for some constant X with \X \ = 1. 

The Class 

14.10 Definition is the class of all / e H(U) which are one-to-one in U and 
'which satisfy 

/(o) = o, m = i. (i) 

Thus every / e Sf has a power series expansion 

f(z) = z + f>„ z" (zeU). (2) 

n = 2 

The class Sf is not closed under addition or multiplication, but has many 
other interesting properties. We shall develop only a few of these in this 
section. Theorem 14.15 will be used in the proof of Mergelyan’s theorem, in 
Chap. 20. 

14.11 Example If |a| < 1 and 


W) = 


(i 


a z) 2 


= Ina" 

n = 1 


“I rr n 


then f a e 9*. 

For if / a (z) = / a (w), then (z — w)(l — a 2 zw) = 0, and the second factor is 
not 0 if | z | < 1 and | w | < 1. 

When | a | = l,/ a is called a Koebe function. We leave it as an exercise to 
find the regions fffJ). 

14.12 Theorem (a) Iff e 9, \ a | = 1, and g(z) = a/(az), then g e 9. ( b ) Iff e 9 
there exists a g e 9 such that 

9 2 (z) =/(z 2 ) (z e U). 


(1) 
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Proof (a) is clear. To prove ( b ), write /(z) = z<p(z). Then cp e H(U\ q>( 0) = 1, 
and cp has no zero in U , since / has no zero in U — {0}. Hence there exists an 
h e H(U) with h( 0) = 1, h 2 (z) = <p(z). Put 

g(z) •-= zh(z 2 ) (z e U). (2) 

Then g 2 (z) = z 2 h 2 (z 2 ) = z 2 (p(z 2 ) =f(z 2 \ so that (1) holds. It is clear that 
#(0) = 0 and g'(0) = 1. We have to show that g is one-to-one. 

Suppose z and w e U and g(z) = g(w). Since / is one-to-one, (1) implies 
that z 2 = w 2 . So either z = w (which is what we want to prove) or z = — w. 
In the latter case, (2) shows that g(z) = — g(w); it follows that g(z) = g(w) = 0, 
and since g has no zero in U — {0}, we have z = w = 0. Ill 7 


14.13 Theorem IfF e H(U — {0}), F is one-to-one in U, and 
F(z) = -+ X a » Z " ( 2 6 U)> 

2 n = 0 


( 1 ) 


then 


oo 


Z M I a » I 2 ^ L 


(2) 


This is usually called the area theorem , for reasons which will become apparent in 
the proof. 

Proof The choice of a 0 is clearly irrelevant. So assume a 0 = 0. Neither the 
hypothesis nor the conclusion is affected if we replace F(z) by AF(Az) 
( |A| = 1). So we may assume that a x is real. 

Put U r = {z: \z\ < r}, C r — (z: | z | = r}, and V r = (z: r < \ z \ < 1}, for 
0 < r < 1. Then F(U r ) is a neighborhood of oo (by the open mapping 
theorem, applied to 1/F); the sets F(£/ r ), F(C r ), and F( V r ) are disjoint, since F 
is one-to-one. Write 


F = u + iv, and 


F(z) = - + a t z + (p(z) (z e U ), 
z 


A =- - b a,r, B = - — a ,r. 
r r 


For z = re 10 , we then obtain 


(3) 


(4) 


u = A cos 0 + Re <p and i; = —B sin 6 + Im cp. 


(5) 
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Divide Eqs. (5) by A and B , respectively, square, and add : 

sin 6 


,2 v 2 

a 2 + W ~ 1 + 


n /Re (p\ 2 2 sin 0 T /im <zA 2 

Re<p + (^rJ B~ lm(p + {~r) ■ 


By (3), <p has a zero of order at least 2 at the origin. If we keep account of (4), 
it follows that there exists an rj > 0 such that, for all sufficiently small r, 


w 2 v 2 


A 2+^2 <1+r l r3 ( Z « rei9 )- 


( 6 ) 


This says that F(C r ) is in the interior of the ellipse E r whose semiaxes are 
Ay/l 4- rjr 3 and Bj 14- rjr 3 , and which therefore bounds an area 

nAB(\ 4- rjr 3 ) = 4- a i r ^~ — a i r ^(! + ^ 3 ) < ^ (1 4- fyr 3 ). (7) 

Since F(C r ) is in the interior of E ri we have E r c= F(U r ); hence F(V r ) is in 
the interior of E ri so the area of F(V r ) is no larger than (7). The Cauchy- 
Riemann equations show that the Jacobian of the mapping (x, y) — ► (m, v) is 
| F' | 2 . Theorem 7.26 therefore gives the following result: 


1(1 + ^ 3 ) 


■f-r- 

- 2 ”f 


t 2 e 2,8 + y t n l e ,(n 1)8 1 2 d() 


(t 3 + X n 2 1 a„ | 2 t 2 " *) df 


= n{r 2 - 1 + £ n | a„ | 2 (1 - r 2 ")}. 
i 

If we divide (8) by n and then subtract r~ 2 from each side, we obtain 


(8) 


Z « I l 2 (l - r 2n ) < 1+ r\r 

n = 1 


( 9 ) 


for all sufficiently small r and for all positive integers N. Let r— ► 0 in (9), then 
let N-* oo. This gives (2). //// 


Corollary Under the same hypothesis , | a x | < 1. 

That this is in fact best possible is shown by F(z) = (1 /z) + az, |a| = 1, 
which is one-to-one in U. 
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14.14 Theorem Iff e 9, and 

f(z) = Z + X a n 

n — 2 

then (a) \ a 2 \ < 2, and (b) f(U) => D( 0; i). 

The second assertion is that f(U) contains all w with | w | < 4. 

Proof By Theorem 14.12, there exists a g e 9 so that g 2 (z) =f(z 2 ). If 
G = l/g, then Theorem 14.13 applies to G, and this will give (a). Since 

/(z 2 ) = z 2 (l + a 2 z 2 + •••), 

we have 

g(z) = z( 1 + \a 2 z 2 + • • •), 

and hence 


G(z) = - (1 - \a 2 z 2 + • • •) = - - ^ z + 
z z z 


The Corollary to Theorem 14.13 shows now that | a 2 \ < 2. 
To prove ( b ), suppose w £ /(G). Define 


h{z) = 


/(z) 


1 -/(z)/w 

Then ft e ff(G), h is one-to-one in G, and 


/i(z) = (z + a 2 z 2 + • • *)^1 + — + * * = z + ^a 2 + ^jz 2 + 


so that he 9. Apply (a) to h: We have | a 2 + (1/w) | < 2, and since | a 2 \ < 2, 
we finally obtain 1 1/w | < 4. So | w | > £ for every w £/(G). 

This completes the proof. //// 


Example 14.11 shows that both (a) and ( b ) are best possible. 

Moreover, given any a ^ 0, one can find entire functions /, with /( 0) = 0, 
/'( 0) = 1, that omit the value a. For example, 

/(z) = a(l - IT*'*). 

Of course, no such / can be one-to-one in G if | a | < i. 


14.15 Theorem Suppose F e H(U — {0}), F is one-to-one in G, F has a pole of 
order 1 at z = 0, with residue 1, and neither vv x nor w 2 are in F(G). 


Then | w x — w 2 | < 4. 
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Proof If / = 1/(F — wj, then / e hence f(U) => D( 0, *), so the image of U 
under F — w x contains all w with | w | > 4. Since w 2 — w x is not in this image, 
we have | w 2 — w x | < 4. //// 

Note that this too is best possible: If F(z) = z _1 + z, then F(U) does not 
contain the points 2, —2. In fact, the complement of F(U) is precisely the interval 
[ — 2, 2] on the real axis. 


Continuity at the Boundary 

Under certain conditions, every conformal mapping of a simply connected region 
Q onto U can be extended to a homeomorphism of its closure Q onto £7. The 
nature of the boundary of Q plays a decisive role here. 

14.16 Definition A boundary point P of a simply connected plane region Q 
will be called a simple boundary point ofQ if P has the following property: To 
every sequence {a„} in Q such that ot n — ► /? as n— > oo there corresponds a 
curve y, with parameter interval [0, 1], and a sequence {t„}, 0 < t l < t 2 < 

• , 1, such that y(t n ) = ot n (n = 1, 2, 3, . . .) and y(t) e Q for 0 < t < 1. 

In other words, there is a curve in Q which passes through the points a„ 
and which ends at /?. 


14.17 Examples Since examples of simple boundary points are obvious, let 
us look at some that are not simple. 

If Q is U — {x: 0 < x < 1}, then Q is simply connected; and if 0 < /? < 1, 
P is a boundary point of Q which is not simple. 

To get a more complicated example, let Q 0 be the interior of the square 
with vertices at the points 0, 1, 1 + i, and i. Remove the intervals 


[i’S + V'] “ d [sr+T + 5T+T + '] 

from Q 0 . The resulting region Q is simply connected. If 0 < y < 1, then iy is 
a boundary point which is not simple. 


14.18 Theorem Let Q be a bounded simply connected region in the plane , and 
let fbe a conformal mapping of Q onto U. 

(a) If P is a simple boundary point of Q, then f has a continuous extension to 
Q u {P}. Iff is so extended , then \f(P) \ = 1. 

(b) If Pi and P 2 are distinct simple boundary points of Q and if f is extended to 
Q u {pi} u {p 2 } as in (a), thenf(pi) ^ f(P 2 ). 

Proof Let g be the inverse of f Then g e H{JJ\ by Theorem 10.33, g(U) — Q, 
g is one-to-one, and g e H 00 , since Q is bounded. 
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Suppose (a) is false. Then there is a sequence {a„} in ft such that a„— ► ft 
/( a 2 n)— ► /( a 2 n + i)^ w 2 , and vv x 9 * vv 2 . Choose 7 as in Definition 14.16, 

and put T(r) = /(y(r)), for 0 < t < 1. Put K r = g(D( 0; r)), for 0 < r < 1. Then 
K r is a compact subset of Q. Since y(r)— ► f as r — ► 1, there exists a r* < 1 
(depending on r) such that y(t) $ K r if t* < t < 1. Thus | T(r) | > r if 
t* < t < 1. This says that | F(r) | — ► 1 as t—> 1. Since r(r 2n )— ► vv x and 
r(r 2 „ + 1 )-> w 2 , we also have I w x | = | w 2 | = 1 . 

It now follows that one of the two open arcs J whose union is 
T — ({Wi} u {w 2 }) has the property that every radius of U which ends at a 
point of J intersects the range of T in a set which has a limit point on T. 
Note that g(T(t)) = y(t) for 0 < t < 1 and that g has radial limits a.e. on T, 
since g e H°°. Hence 

hm 0 (re*) = /? (a.e. on J), (1) 

r- 1 

since 0 (T(r))— ► /? as t—> 1. By Theorem 11.32, applied to g — /?, (1) shows that 
g is constant. But g is one-to-one in U , and we have a contradiction. Thus 
= w 2 , and (a) is proved. 

Suppose ( b ) is false. If we multiply / by a suitable constant of absolute 
value 1 , we then have ^ but f(P x ) = f(fi 2 ) = 1 . 

Since ^ and fi 2 are simple boundary points of Q, there are curves y t with 
parameter interval [ 0 , 1 ] such that y f ([ 0 , 1 )) c= Q for i = 1 and 2 and y f (l) = 
ft. Put r i (t)=f(y i (t)). Then ^([ 0 , 1 )) c = [/, and T^l) = T 2 (l) = 1. Since 
0 (T,(O) = 7/(0 on [ 0 , 1 ), we have 


lim^(T I .(0) = ft O’ = 1 , 2 ). ( 2 ) 

r-l 

Theorem 12.10 implies therefore that the radial limit of g at 1 is ft as well as 
ft . This is impossible if ft i=- ft . //// 

14.19 Theorem // Q is a bounded simply connected region in the plane and if 
every boundary point of Q is simple , then every conformal mapping of Q onto U 
extends to a homeomorphism of Q onto U. 


Proof Suppose / e H(Q),f(Q) — U , and / is one-to-one. By Theorem 14.18 
we can extend / to a mapping of Q into U such that /(a„)^/(z) whenever 
{a„} is a sequence in Q which converges to z. If {z n } is a sequence in Q which 
converges to z, there exist points a n eQ such that | a„ — z„ | < 1 /n and 
|/(a„) — /(z„)| < 1/n. Thus a >z, hence /(a„) — >/(z), and this shows that 
/(z„)->/(z). 

We have now proved that our extension of / is continuous on Q. Also 
U <=f (ft) U. The compactness of U implies that / (fi) is compact. Hence 

m) = u . 
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Theorem 14.18(h) shows that /is one-to-one on Q. Since every contin- 
uous one-to-one mapping of a compact set has a continuous inverse ([26], 
Theorem 4.17), the proof is complete. //// 


14.20 Remarks 

(a) The preceding theorem has a purely topological corollary : If every bound- 
ary point of a bounded simply connected plane region Q is simple , then the 
boundary of Q is a Jordan curve , and Q is homeomorphic to U. 

(A Jordan curve is, by definition, a homeomorphic image of the unit 
circle.) 

The converse is true, but we shall not prove it: If the boundary of Q 
is a Jordan curve, then every boundary point of Q is simple. 

(b) Suppose /is as in Theorem 14.19, a, h, and c are distinct boundary points 
of Q, and A, B , and C are distinct points of T. There is a linear fractional 
transformation (p which maps the triple {/(«), /(h), /(c)} to { A , B , C}; 
suppose the orientation of {^4, B , C} agrees with that of {/(«), /(h), /(c)}; 
then cp(U) = U , and the function g = (p o f is a homeomorphism of Q 
onto U which is holomorphic in Q and which maps (a, h, c} to pre- 
scribed values { A , B , C}. It follows from Sec. 14.3 that g is uniquely 
determined by these requirements. 

(c) Theorem 14.19, as well as the above remark (h), extends without difficulty 
to simply connected regions Q in the Riemann sphere S 2 , all of whose 
boundary points are simple, provided that S 2 — Q has a nonempty inte- 
rior, for then a linear fractional transformation brings us back to the case 
in which Q is a bounded region in the plane. Likewise, U can be 
replaced, for instance, by a half plane. 

(d) More generally, if f x and f 2 map and Q 2 onto 17, as in Theorem 14.19, 
then f = f 2 1 °/i is a homeomorphism of onto Q 2 which is holo- 
morphic in Qj. 

Conformal Mapping of an Annulus 

14.21 It is a consequence of the Riemann mapping theorem that any two simply 
connected proper subregions of the plane are conformally equivalent, since each 
of them is conformally equivalent to the unit disc. This is a very special property 
of simply connected regions. One may ask whether it extends to the next simplest 
situation, i.e., whether any two annuli are conformally equivalent. The answer is 
negative. 

For 0 < r < R, let 


A(r, R) = {z:r<\z\<R} (1) 

be the annulus with inner radius r and outer radius R. If k > 0, the mapping 
z— ► kz maps A(r, R) onto A(2r, 2R). Hence A(r , R) and A(r l9 Ri) are conformally 
equivalent if R/r = RJr v The surprising fact is that this sufficient condition is 
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also necessary; thus among the annuli there is a different conformal type associ- 
ated with each real number greater than 1. 

14.22 Theorem A(r l9 Ry) and A(r 2 , R 2 ) are conformally equivalent if and only 
ifKi/ri = R 2 /r z . 

Proof Assume r x = r 2 = 1, without loss of generality. Put 

Ay = >1(1, Ry), A 2 = A(l, R 2 ) 9 (1) 

and assume there exists / e H(Ay) such that /is one-to-one and f(Ay)- A 2 . 
Let K be the circle with center at 0 and radius r = >//^ . Since / “ 1 : A 2 — ► Ay 
is also holomorphic 9 f~ l (K) is compact. Hence 

A(l 9 l+e)nf~ 1 (K) = 0 ( 2 ) 

for some e > 0. Then V = f(A(l 9 1 + e)) is a connected subset of A 2 which 
does not intersect K , so that V a A(l, r) or V c= A(r 9 R 2 ). In the latter case, 
replace /by R 2 /f So we can assume that V a A (\ 9 r). If 1 <\z n \ < 1 +6 and 
|zj— ► 1, then f(z n ) e V and {/(z„)} has no limit point in A 2 (since / _1 is 
continuous); thus |/(z„)|— ► 1. In the same manner we see that |/(zj|— ► R 2 if 

Now define 


log R 2 
log R 


and 


u(z) = 2 log | /(z) | - 2a log | z | (z e A x ). (4) 

Let ^ be one of the Cauchy-Riemann operators. Since df = 0 and 5/ = /', the 
chain rule gives 


^(2 1og|/|) = ^(log(j/))=/7/, (5) 

so that 

(du)(z) = - - (ze^i). (6) 

/(z) z 

Thus u is a harmonic function in A y which, by the first paragraph of this 
proof, extends to a continuous function on A l which is 0 on the boundary of 
A v Since nonconstant harmonic functions have no local maxima or minima, 
we conclude that u — 0. Thus 

/'(z) _ « 

/(z) Z 


(z 6 Ay). 


( 7 ) 
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Put y(t) = y/Rle u ( — n<t<n); put T =/ o y. As in the proof of 
Theorem 10.43, (7) gives 

“ - 1 - Indr <0) - (8) 

Thus a is an integer. By (3), a > 0. By (7), the derivative of z~*f(z) is 0 in A v 
Thus f(z) = cz a . Since /is one-to-one in A l9 a = 1. Hence R 2 = R //// 


Exercises 


1 Find necessary and sufficient conditions which the complex numbers a, b , c, and d have to satisfy 
so that the linear fractional transformation z — ► (az 4- b)/(cz + d) maps the upper half plane onto itself. 

2 In Theorem 11.14 the hypotheses were, in simplified form, that Q c: n + , L is on the real axis, and 
Im/(z)— ► 0 as z— ► L. Use this theorem to establish analogous reflection theorems under the following 
hypotheses: 

(u) ft c: n + , L on real axis, |/(z)|— ► 1 as z — > L. 

(b) Qc[/,Lc T, | f (z) | — * 1 as z — L. 

(c) QczU,Lcz T, Im/(z) — 0 as z-> L. 

In case (h), if / has a zero at a e ft, show that its extension has a pole at 1/a. What are the 
analogues of this in cases (a) and (c)? 

3 Suppose R is a rational function such that | R(z) | = 1 if | z | = 1. Prove that 


k 


R(z) = cz m n 

n= 1 


Z — cc n 
1 - 0 „z 


where c is a constant, m is an integer, and a l5 ..., a fc are complex numbers such that a„ # 0 and 
| a„ | # 1. Note that each of the above factors has absolute value 1 if | z | = 1. 

4 Obtain an analogous description of those rational functions which are positive on T. 

Hint : Such a function must have the same number of zeros as poles in U. Consider products of 
factors of the form 


(z — aXl — az) 

{z-m-fo 

where | a | < 1 and | \ < 1. 

5 Suppose / is a trigonometric polynomial, 

m= t a k e iM , 

k — ~n 

and f{6) > 0 for all real 0. Prove that there is a polynomial P(z) = c 0 + c x z + • • • + c n z" such that 

/(0)=|P(e*)| 2 {6 real). 

Hint : Apply Exercise 4 to the rational function Su fc z*. Is the result still valid if we assume f{0) > 0 
instead of f(0) > 0? 

6 Find the fixed points of the mappings q> a (Definition 12.3). Is there a straight line which (p a maps to 
itself? 

7 Find all complex numbers a for which f a is one-to-one in U, where 


Describe f a (U) for all these cases. 
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8 Suppose /(z) = z 4- (1/z). Describe the families of ellipses and hyperbolas onto which / maps circles 
with center at 0 and rays through 0. 

9 ( a ) Suppose £2 = {z: — 1 < Re z< 1}. Find an explicit formula for the one-to-one conformal 
mapping/ of £2 onto U for which /(0) = 0 and /'( 0) > 0. Compute /'( 0). 

(b) Note that the inverse of the function constructed in ( a ) has its real part bounded in U , 
whereas its imaginary part is unbounded. Show that this implies the existence of a continuous real 
function u on U which is harmonic in U and whose harmonic conjugate v is unbounded in U. [t; is 
the function which makes u + iv holomorphic in U ; we can determine v uniquely by the requirement 
*0) = 0.] 

(c) Suppose g e H(U), | Re g \ < 1 in JJ t and #(0) = 0. Prove that 

\g(re la ) \ < - log 7-^-. 

n 1 — r 


Hint : See Exercise 10. 

(d) Let £2 be the strip that occurs in Theorem 12.9. Fix a point a + ifi in £2. Let h be a conformal 
one-to-one mapping of £2 onto £2 that carries a + ifi to 0. Prove that 

| h'(<x + if}) | = 1/cos /?. 

10 Suppose / and g are holomorphic mappings of U into £2, /is one-to-one, /((/) = £2, and /( 0) = 0(0). 
Prove that 


9 (D( 0; r)) c=/(D( 0; r)) (0 < r < 1). 

11 Let £2 be the upper half of the unit disc U. Find the conformal mapping/ of £2 onto U that carries 
{ — 1, 0, 1} to { — 1, — i, 1}. Find z e £2 such that /(z) = 0. Find /(i/2). Hint : f = (p ° s ° where (p and 
ij/ are linear fractional transformations and s(>l) = X 2 . 

12 Suppose £2 is a convex region,/ e //(£ 2), and Re f\z) > 0 for all z e £2. Prove that /is one-to-one in 
£2. Is the result changed if the hypothesis is weakened to Re f\z) > 0? (Exclude the trivial case 
/ = constant.) Show by an example that “convex” cannot be replaced by “simply connected.” 

13 Suppose £2 is a region,/, e H{i 2) for n = 1, 2, 3, ... , each /„ is one-to-one in £2, and /„— >f uniformly 
on compact subsets of £2. Prove that / is either constant or one-to-one in £2. Show that both cases can 
occur. 

14 Suppose £2 = {x + iy: — 1 < y < 1},/ e H(£ 2), \f \ < 1, and fix)— > 0 as x— ► 00. Prove that 

lim f(x + iy) = 0 (— 1 < y < 1) 

X-»QO 

and that the passage to the limit is uniform if y is confined to an interval [ — a, a], where a < 1. Hint : 
Consider the sequence {/„}, where /„(z) = z + n, in the square | x | < 1, | y \ < 1. 

What does this theorem tell about the behavior of a function g e H 00 near a boundary point of 
U at which the radial limit of g exists? 

15 Let be the class of all / e H{U) such that Re / > 0 and /( 0) = 1. Show that is a normal 
family. Can the condition “/(0) = 1” be omitted? Can it be replaced by “ |/(0) | < 1”? 

16 Let 3F be the class of all / e H{U) for which 

jj l/(z)l 2 dx dy< 1. 

V 


Is this a normal family? 

17 Suppose £2 is a region,/, e H(£2) for n = 1, 2, 3, ... ,/„—►/ uniformly on compact subsets of £2, and 
/ is one-to-one in £2. Does it follow that to each compact Kc£2 there corresponds an integer N(K) 
such that f n is one-to-one on K for all n > N(K)? Give proof or counterexample. 
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18 Suppose Q is a simply connected region, z 0 e Q, and / and g are one-to-one conformal mappings 
of Q onto U which carry z 0 to 0. What relation exists between / and g ? Answer the same question if 
/ (z 0 ) = #( z o) = a > for some a e U. 

19 Find a homeomorphism of U onto U which cannot be extended to a continuous function on U. 

20 If f e (Definition 14.10) and n is a positive integer, prove that there exists a g e Sf such that 
g\z) =/(z") for all z e 17. 

21 Find all/e ^ such that (a)/(l/) => 17, (6)/(l7) 3 L7, (c) | a 2 | = 2. 

22 Suppose / is a one-to-one conformal mapping of U onto a square with center at 0, and / (0) = 0. 
Prove that f(iz) = if{z). If /(z) = Ec„z", prove that c n = 0 unless n — 1 is a multiple of 4. Generalize 
this : Replace the square by other simply connected regions with rotational symmetry. 

23 Let Q be a bounded region whose boundary consists of two nonintersecting circles. Prove that 
there is a one-to-one conformal mapping of Q onto an annulus. (This is true for every region Q such 
that S 2 — Q has exactly two components, each of which contains more than one point, but this 
general situation is harder to handle.) 

24 Complete the details in the following proof of Theorem 14.22. Suppose 1 < R 2 < and / is a 
one-to-one conformal mapping of A( 1, R t ) onto A( 1, R 2 ). Define f x = f and f n = / o f n _ v Then a sub- 
sequence of {/„} converges uniformly on compact subsets of 4(1, R t ) to a function g. Show that the 
range of g cannot contain any nonempty open set (by the three-circle theorem, for instance). On the 
other hand, show that g cannot be constant on the circle {z: | z | 2 = Rj. Hence / cannot exist. 

25 Here is yet another proof of Theorem 14.22. If / is as in 14.22, repeated use of the reflection 
principle extends / to an entire function such that | f(z) \ = 1 whenever | z | = 1. This implies /(z) = az", 
where | a | = 1 and n is an integer. Complete the details. 

26 Iteration of Step 2 in the proof of Theorem 14.8 leads to a proof (due to Koebe) of the Riemann 
mapping theorem which is constructive in the sense that it makes no appeal to the theory of normal 
families and so does not depend on the existence of some unspecified subsequence. For the final step 
of the proof it is convenient to assume that Q has property ( h ) of Theorem 13.11. Then any region 
conformally equivalent to Q will satisfy (/z). Recall also that ( h ) implies (/'), trivially. 

By Step 1 in Theorem 14.8 we may assume, without loss of generality, that 0 e Q, Q c U, and 
Q t* U. Put Q = D 0 . The proof consists in the construction of regions Q 2 , D 3 , ... and of func- 
tions f x ,f 2 ,/ 3 , . . . , so that / n (Q n - 1 ) = and so that the functions f n °f„- l ° * • • °f 2 0 f\ converge to a 
conformal mapping of Q onto U. 

Complete the details in the following outline. 

{a) Suppose Q„_! is constructed, let r n be the largest number such that D(0; r n ) czCl n _ u let a„ be 
a boundary point of t with | a„| = r„ , choose (3 n so that /? 2 = — a„ , and put 

F n = (p_ an o s o (p_ Pn . 

(The notation is as in the proof of Theorem 14.8.) Show that F n has a holomorphic inverse G n in Q„_ 1} 
and put /„ = A„G„, where X n = |c|/c and c = G^(0). (This f n is the Koebe mapping associated with 
Q„_!. Note that f n is an elementary function. It involves only two linear fractional transformations 
and a square root.) 

{b) Compute that f'JO) = (1 + r t )/l y /7 n > 1. 

(c) Put \j/ 0 (z) = z and i l/ n {z) = /„(^„_i(z)). Show that i J/ n is a one-to-one mapping of Q onto a 
region Q„ c: U, that {i J/'JlQ)} is bounded, that 


n 


wo) = n 

k — 1 


1 + h 


and that therefore r„— ► 1 as n— > oo. 

(i d ) Write \f/ n {z) = zh n {z ), for z e Q, show that \ h n \ <\h n+l \, apply Harnack’s theorem and Exer- 
cise 8 of Chap. 1 1 to {log | h n \ } to prove that {i l/ n } converges uniformly on compact subsets of Q, and 
show that lim if/ n is a one-to-one mapping of Q onto U. 
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27 Prove that x (1 — r n ) 2 < oo, where {r„} is the sequence which occurs in Exercise 26. Hint: 

[±r (1 - y/rf 

2 Jr ' 

28 Suppose that in Exercise 26 we choose a„ e U — Q n _ t without insisting that |aj = r„. For 
example, insist only that 


Kl< 


Ltln 

2 


Will the resulting sequence {ij/ n } still converge to the desired mapping function? 
29 Suppose Q is a bounded region, a e Q ,/ e H(Q),/(Q) c: Q, and f(a ) = a. 

{a) Put fi =f and /„ = / ° /„_ j, compute f' n {a), and conclude that | f\a) \ < 1. 
(b) If f\a) = 1, prove that f(z) = z for all z e Q. Hint: If 


/(z) = z + cjz -a) m + ■■■, 

compute the coefficient of (z — a) m in the expansion of f n {z). 

( c ) If | f'{a) | = 1, prove that /is one-to-one and that f{Q) = Q. 

Hint: If y =f\a\ there are integers n k — ► oo such that y nk — > 1 and f Hi —* g- Then g\a) = 1, 
g{G) c= Q (by Exercise 20, Chap. 10), hence g{z) = z, by part ( b ). Use g to draw the desired conclusions 
about / 

30 Let A be the set of all linear fractional transformations. 

If {a, p, y, <5} is an ordered quadruple of distinct complex numbers, its cross ratio is defined to be 


[*, ft, y, < 5 ] 


(a - - <*) 

(a - axy - /?)' 


If one of these numbers is oo, the definition is modified in the obvious way, by continuity. The same 
applies if a coincides with p or y or 3. 

{a) If q>(z) = [z, a, /?, y], show that (p e A and q> maps {a, p, y} to {0, 1, oo}. 

( b ) Show that the equation [w, a, b, c ] = [z, a, P, y] can be solved in the form w = <p(z ); then 
(p e A maps {a, p, y} to {a, b, c}. 

(c) If (p e A, show that 


O(a), ip(fl), cp(y), <p(, 5)] = [a, [i, y, 8], 

(i d ) Show that [a, p, y, is real if and only if the four points lie on the same circle or straight 

line. 

(i e ) Two points z and z* are said to be symmetric with respect to the circle (or straight line) C 
through a, /?, and y if [z*, a, P, y] is the complex conjugate of [z, a, P, y]. If C is the unit circle, find a 
simple geometric relation between z and z*. Do the same if C is a straight line. 

(/) Suppose z and z* are symmetric with respect to C. Show that <p{z) and (p(z *) are symmetric 
with respect to (p(C), for every (p e A. 

31 ( a ) Show that A (see Exercise 30) is a group, with composition as group operation. That is, if 

(p e A and ip e A, show that (p ° \p e A and that the inverse (p ~ 1 of <p is in A. Show that A is not 

commutative. 

(b) Show that each member of A (other than the identity mapping) has either one or two fixed 

points on S 2 . [A fixed point of (p is a point a such that (p{ a) = a.] 

(c) Call two mappings (p and <p, e A conjugate if there exists a \p e A such that (p l =\p~ l ° (p ° \p. 

Prove that every (p e A with a unique fixed point is conjugate to the mapping z— ► z + 1. Prove that 
every (p e A with two distinct fixed points is conjugate to the mapping z — ► az, where a is a complex 
number; to what extent is a determined by <p? 
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(d) Let a be a complex number. Show that to every (p e A which has a for its unique fixed point 
there corresponds a j 3 such that 

-TJ— = — +/?• 

<jp(z) — a z — a 

Let G a be the set of all these (p, plus the identity transformation. Prove that G a is a subgroup of A and 
that G a is isomorphic to the additive group of all complex numbers. 

(e) Let a and j 6 be distinct complex numbers, and let G a be the set of all (p e A which have a 
and p as fixed points. Show that every (p e G a p is given by 

(p(z) — a z — a 
<p(z)-/? y Z-p’ 

where y is a complex number. Show th^t G a fi is a subgroup of A which is isomorphic to the multipli- 
cative group of all nonzero complex numbers. 

(/) If (p is as in ( d) or ( e ), for which circles C is it true that (p{C) = C? The answer should be in 
terms of the parameters a, (3 , and y. 

32 For z e £7, z 2 ^ 1, define 


/(z) = exp 



choosing the branch of log that has log 1=0. Describe / (£) if E is 

(a) U, 

(b) the upper half of T, 

(c) the lower half of T, 

{d) any circular arc (in U) from — 1 to 1, 

( e ) the radius [0, 1), 

(/) any disc {z: | z — r\< 1 — r}, 0 < r < 1. 

(g) any curve in U tending to 1. 

33 If (p a is as in Definition 12.3, show that 

(a) - f \(p' a \ 2 dm = 1, 

* Jv 

(b) ~ f \(p' a \ dm = — -- log - — l — I . 

nj v |a | 2 l-|a | 2 

Here m denotes Lebesgue measure in R 2 . 



CHAPTER 

FIFTEEN 


ZEROS OF HOLOMORPHIC 
FUNCTIONS 


Infinite Products 

15.1 So far we have met only one result concerning the zero set Z(f) of a non- 
constant holomorphic function / in a region Q, namely, Z(f) has ilo limit point in 
Q. We shall see presently that this is all that can be said about Z(/), if no other 
conditions are imposed on /, because of the theorem of Weierstrass (Theorem 
15.11) which asserts that every A cz Q without limit point in Q is Z(f) for some 
/ e H(Q). If A = {a„}, a natural way to construct such an / is to choose functions 
f n e H(Cl) so that /„ has only one zero, at a„ , and to consider the limit of the 
products 


Pn =fJl •••/„> 

as n— > oo. One has to arrange it so that the sequence {p„} converges to some 
/ e H(Q ) and so that the limit function / is not 0 except at the prescribed points 
a n . It is therefore advisable to begin by studying some general properties of infin- 
ite products. 

15.2 Definition Suppose {u n } is a sequence of complex numbers, 

Pn = (1 + “iXl + « 2 ) ••• (1 + “n)> (!) 

and p — lim„^ 00 p n exists. Then we write 

p = n o + “»)• (2) 

n = 1 
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The p n are the partial products of the infinite product (2). We shall say that the 
infinite product (2) converges if the sequence {p n } converges. 
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In the study of infinite series it is of significance whether the a n 
approach 0 rapidly. Analogously, in the study of infinite products it is of 
interest whether the factors are or are not close to 1. This accounts for the 
above notation: 1 + u n is close to 1 if u n is close to 0. 

15.3 Lemma If u l9 . . . , u N are complex numbers , and if 

P J V=fi( 1 +«n), Pn= n (1 + I M n I )> (1) 

n= 1 n = 1 

then 

Pn rSexpduJ + ••• + |Mjv|) (2) 

and 

I Pn ~ 1| ^ P* ~ !• (3) 

Proof For x > 0, the inequality 1 + x < e x is an immediate consequence of 
the expansion of e x in powers of x. Replace x by | u t | , . . . , | u N | and multiply 
the resulting inequalities. This gives (2). For N = 1, (3) is trivial. The general 
case follows by induction: Forfc=l,...,N — 1, 

Pk + 1 - 1 =Pk( 1 + W/t+l)- 1 = (Pk - 1X1 + U k+ l) + M* + l, 


so that if (3) holds with k in place of N, then also 

|Pfc+l — 1 | < (Pjf — 1)(1 + |M*+il) + |Mfc+ll =Pk + 1 - 1 - //// 


15.4 Theorem Suppose { u n } is a sequence of bounded complex functions on a 
set S , such that Z | u n (s) | converges uniformly on 5. Then the product 

00 

ns) = n (! + «»(s)) a) 

n = 1 

converges uniformly on 5, and f(s 0 ) = 0 at some s 0 e S if and only if u n (s 0 ) = 
— 1 for some n. 

Furthermore , if {n u n l9 n 3 , . . .} is any permutation of {1, 2, 3, . . then we 
also have 


/(s) = n (i + ««(*)) e s). (2) 

fc = i 

Proof The hypothesis implies that X|w„(s)| is bounded on 5, and if p N 
denotes the Nth partial product of (1), we conclude from Lemma 15.3 that 
there is a constant C < oo such that | p N (s) | < C for all N and all s. 
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Choose 6, 0 < 6 < j . There exists an N 0 such that 

Z I U n( S ) I < 6 NS). (3) 

n = N o 

Let {n l9 n 2 , n 3 , . . .} be a permutation of {1, 2, 3, . . If N > N 0 , if M is 
so large that 

{1, 2, N} <= {«!, n 2 , n M }, (4) 

and if q M (s) denotes the Mth partial product of (2), then 

<1m - Pn = Pn{I\ ( l +«J“ U- (5) 

The n k which occur in (5) are all distinct and are larger than N 0 . Therefore 
(3) and Lemma 15.3 show that 

I <1m ~ Pn I < I P N \(e € ~ 1) < 2|p*|€ < 2 Ce. (6) 

If n k = k (k = 1, 2, 3, ...), then q M = p M , and (6) shows that {p N } con- 
verges uniformly to a limit function f Also, (6) shows that 

I Pm ~ Pn 0 I ^ 2 1 p No | e (M > N 0 ), (7) 

so that | p M | > (1 - 26) I p No I . Hence 

| f(s) | > (1 — 2e) | p No (s) | (seS), (8) 

which shows that f(s) = 0 if and only if p No (s) = 0. 

Finally, (6) also shows that {q M } converges to the same limit as {p N }. nn 

15.5 Theorem Suppose 0 < u n < 1. Then 

00 00 
n (!-“«)> 0 \f and onl y f Z«»< 

n - 1 n — 1 

Proof If p N = (1 — w x ) • • • (1 — u N ), then Pi > p 2 ^ ” * , Pn > ^ hence p = 
lim p N exists. If < oo, Theorem 15.4 implies p > 0. On the other hand, 

N 

P ^ Pn = 11 (! - M «) ^ ex P {-«i - «2 «*}> 

1 

and the last expression tends to 0 as N— ► oo, if £w n = oo. //// 

We shall frequently use the following consequence of Theorem 15.4: 

15.6 Theorem Suppose f n e H( Q) for n = 1, 2, 3, . . . , no f n is identically 0 in 
any component of Q, and 

00 

Z |l-/»(z)l 


(1) 
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converges uniformly on compact subsets of Q. Then the product 

00 

m = n m (2) 

n= 1 

converges uniformly on compact subsets of Q. Hence f e H(Q). 

Furthermore , we have 


m(f; z) = z z) (z e fi), (3) 

n= 1 

where m(f ; z) is defined to be the multiplicity of the zero of f at z. [If / (z) ^ 0, 
then m(/ ; z) = 0.] 

Proof The first part follows immediately from Theorem 15.4. For the second 
part, observe that each zeQ has a neighborhood V in which at most finitely 
many of the f n have a zero, by (1). Take these factors first. The product of the 
remaining ones has no zero in V, by Theorem 15.4, and this gives (3). Inci- 
dentally, we see also that at most finitely many terms in the series (3) can be 
positive for any given zeQ. //// 

The Weierstrass Factorization Theorem 

15.7 Definition Put E 0 (z) = 1 — z, and for p = 1, 2, 3, ... , 

£p(z) = (1 - z) exp jz + Z — + • • • + 

These functions, introduced by Weierstrass, are sometimes called elementary 
factors. Their only zero is at z = 1. Their utility depends on the fact that they 
are close to 1 if | z | < 1 and p is large, although E p ( 1) = 0. 


15.8 Lemma For \ z \ < 1 and p — 0, 1,2,..., 

ii-F p (z)i<izr +i . 

Proof For p = 0, this is obvious. For p > 1, direct computation shows that 
- E' p (z) = exp jz + y + • • • + . 

So — E p has a zero of order p at z = 0, and the expansion of — E p in powers 
of z has nonnegative real coefficients. Since 

1 - E p (z) = - f E p (w) dw , 

J[0,z] 
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1 — E p has a zero of order p + 1 atz = 0, and if 


</>(z) = 


1 - EJLz) 
z p+i ’ 


then cp(z) = z", with all a n > 0. Hence | <p(z) | < <p(i) = 1 if | z | < 1, and this 

gives the assertion of the lemma. //// 


15.9 Theorem Let {z„} be a sequence of complex numbers such that z„ ^ 0 and 
| z n | — ► oo as n — > oo. If {p n } is a sequence of nonnegative integers such that 


00 


I 


( r \l+Pn 

y <c ° 


•I* 


for every positive r (where r n = | z n | ), then the infinite product 

m-flEjj) (2) 

«=1 \ Z nJ 

defines an entire function P which has a zero at each point z n and which has no 
other zeros in the plane. 

More precisely , if a occurs m times in the sequence {z„}, then P has a zero 
of order m at a. 

Condition (1 ) is always satisfied if p n — n — l, for instance. 


Proof For every r, r n > 2 r for all but finitely many n , hence r/r n < j for these 
n , so (1) holds with 1 + P„ = n. 

Now fix r. If | z | < r, Lemma 15.8 shows that 



if r n > r, which holds for all but finitely many n. It now follows from (1) that 
the series 


00 


I 



converges uniformly on compact sets in the plane, and Theorem 15.6 gives 
the desired conclusion. //// 


Note: For certain sequences {r„}, (1) holds for a constant sequence {p n }. It is 
of interest to take this constant as small as possible; the resulting function (2) is 
then called the canonical product corresponding to {z„}. For instance, if 
SI /r n < oo, we can take p n = 0, and the canonical product is simply 
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If Zl/r B = 00 but El /r\ < oo, the canonical product is 



Canonical products are of great interest in the study of entire functions of 
finite order. (See Exercise 2 for the definition.) 

We now state the Weierstrass factorization theorem. 

15.10 Theorem Let f be an entire function , suppose /( 0) # 0, and let z l9 z 2 , z 3 , 
...be the zeros of f listed according to their multiplicities. Then there exist an 
entire function g and a sequence { p n } of nonnegative integers , such that 

/(z) =<*<*> n eJ f\ (i) 

n = 1 \ Z nJ 

Note: (a) If / has a zero of order k at z = 0, the preceding applies to f{z)/z k . (b) 
The factorization (1) is not unique; a unique factorization can be associated with 
those / whose zeros satisfy the condition required for the convergence of a canon- 
ical product. 

Proof Let P be the product in Theorem 15.9, formed with the zeros of f. 
Then f/P has only removable singularities in the plane, hence is (or can be 
extended to) an entire function. Also, f/P has no zero, and since the plane is 
simply connected, f/P = e g for some entire function g. //// 

The proof of Theorem 15.9 is easily adapted to any open set: 

15.11 Theorem Let Q be an open set in S 2 , Q # S 2 . Suppose A a Q and A has 
no limit point in Q. With each a e A associate a positive integer m( a). Then 
there exists anfe H(Q) all of whose zeros are in A, and such that f has a zero 
of order m( a) at each aei 

Proof It simplifies the argument, and causes no loss of generality, to assume 
that oo e Q but oo £ A. (If this is not so, a linear fractional transformation 
will make it so.) Then S 2 — Q is a nonempty compact subset of the plane, 
and oo is not a limit point of A. 

If A is finite, we can take a rational function for f. 

If A is infinite, then A is countable (otherwise there would be a limit 
point in Q). Let {a„} be a sequence whose terms are in A and in which each 
a e A is listed precisely m(a) times. Associate with each a„ a point e 
S 2 — Q such that | /?„ — <x n \ < \ f — a n \ for all /? e S 2 — Q; this is possible 
since S 2 — Q is compact. Then 
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as n— ► oo ; otherwise A would have a limit point in Q. We claim that 


00 

m = n En 


n= 1 



has the desired properties. 

Put r n = 2 | <x n — f$ n | . Let Xbea compact subset of Q. Since r„— ► 0, there 
exists an N such that | z — /?„ | > r n for all z e K and all n> N. Hence 


- Pn 1 

z-K ~ 2 ’ 


which implies, by Lemma 15.8, that 



(z e K, n> N), 


and this again completes the proof, by Theorem 15.6. 


//// 


As a consequence, we can now obtain a characterization of meromorphic 
functions (see Definition 10.41): 

15.12 Theorem Every meromorphic function in an open set Q is a quotient of 
two functions which are holomorphic in Q. 


The converse is obvious : If g e H{Q), h e H(Q), and h is not identically 0 in 
any component of Q, then g/h is meromorphic in Q. 

Proof Suppose /is meromorphic in Q; let A be the set of all poles of/ in Q; 
and for each a e A, let m( a) be the order of the pole of / at a. By Theorem 
15.11 there exists an he H(Q) such that h has a zero of multiplicity m( a) at 
each a e A, and h has no other zeros. Put g =fh. The singularities of g at the 
points of A are removable, hence we can extend g so that g e H(Q). Clearly, 
/= g/h in Q — A //// 

An Interpolation Problem 

The Mittag-Leffler theorem may be combined with the Weierstrass theorem 
15.11 to give a solution of the following problem: Can we take an arbitrary set 
A a Q, without limit point in Q, and find a function / e H(Q) which has pre- 
scribed values at every point of A? The answer is affirmative. In fact, we can do 
even better, and also prescribe finitely many derivatives at each point of A : 

15.13 Theorem Suppose Q is an open set in the plane , A a Q, A has no limit 
point in Q, and to each a e A there are associated a nonnegative integer m( a) 
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and complex numbers w„ a , 0 < n < m( a). Then there exists an f e H(Q) such 
that 


/ (n) (a) = n! w n> „ (a e A, 0 < n < m( a)). (1) 


Proof By Theorem 15.11, there exists a g e H(Q) whose only zeros are in A 
and such that g has a zero of order m( a) + 1 at each oc e A. We claim we can 
associate to each a e A a function P a of the form 

1 + m(a) 

PJLz)= I CjJz-*r J (2) 

j= 1 

such that gP a has the power series expansion 

g(z)P a (z) = w 0> , + w u „(z - a) + • • • + w m(ah a (z - + • • • (3) 

in some disc with center at a. 

To simplify the writing, take a = 0 and m( a) = m, and omit the subscripts 
a. For z near 0, we have 


g(z)^b 1 z m+1 +b 2 z m + 2 + ■■■, (4) 

where /), ^ 0. If 

P(z) = c i z~ i + ■■■ + C m+l z~ m ~\ (5) 


then 


g(z)P(z) = (c m + i + c m z + ■■■ + Ciz m ) (b t + b 2 z + b 3 z 2 + ■■■). (6) 

The b' s are given, and we want to choose the c’s so that 

g(z)P(z) = w 0 + w 2 z + ■ ■ ■ + w m z m + • • • . (7) 

If we compare the coefficients of 1, z, ..., z m in (6) and (7), we can solve the 
resulting equations successively for c m + 19 c m , . . . , c l9 since b l j=- 0. 

In this way we obtain the desired P a ’s. The Mittag-Lefller theorem now 
gives us a meromorphic h in Q whose principal parts are these P a ’s, and if we 
put f = gh we obtain a function with the desired properties. //// 

The solution of this interpolation problem can be used to determine the 
structure of all finitely generated ideals in the rings H(Q). 

15.14 Definition The ideal \_g x , ..., g n ] generated by the functions g u ..., g n e 
H(Q) is the set of all functions of the form g t , where f e H(Q) for 
i = 1, ..., n A principal ideal is one that is generated by a single function. 
Note that [1] = H(Q). 

life H(Q), aeQ, and / is not identically 0 in a neighborhood of a, the 
multiplicity of the zero of / at a will be denoted by m(f ; a). If /(a) ^ 0, then 
m(f ; a) = 0, as in Theorem 15.6. 
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15.15 Theorem Every finitely generated ideal in H( Q) is principal 

More explicitly: If g u g n e H( Q), then there exist functions g,f i9 h t e H( Q) 
such that 


9 = Z fi9i and g t = h ( g (1 < i < n). 

i = 1 

Proof We shall assume that Q is a region. This is done to avoid problems 
posed by functions that are identically 0 in some components of Q but not in 
all. Once the theorem is proved for regions, that case can be applied to each 
component of an arbitrary open set Q, and the full theorem can be deduced. 
We leave the details of this as an exercise. 

Let P(n) be the following proposition : 

If g i, . . . , g n e H( Q), if no g t is identically 0, and if no point of Q is a zero of 
every g t , then j _g u . . . , g J = [1]. 

P(l) is trivial. Assume that n > 1 and that P(n — 1) is true. Take g u . .., 
g n eH( Q), without common zero. By the Weierstrass theorem 15.11 there 
exists cp e H( Q) such that 

m(cp; a) = min a): 1 < i < n — 1} (ae Q). (1) 

The functions/; = gjcp (1 < i < n — 1) are in H( Q) and have no common zero 
in Q. Since P(n — 1) is true, [/ x , . x ] = [1]. Hence 


LGu Gn-u Gn\ = 1<P> Gnl ( 2 ) 

Moreover, our choice of cp shows that g n ( a) ^ 0 at every point of the set 
A = {a e Q: cp( a) = 0}. Hence it follows from Theorem 15.13 that there exists 
h e H(Q) such that 


m(l - hg n ; a) > m(cp; a) (a e Q). (3) 

Such an h is obtained by a suitable choice of the prescribed values of h {k \oi) 
for a e /l and for 0 < k < m(cp ; a). 

By (3), (1 — hg n )/cp has removable singularities. Thus 

1 =hg n +f<p (4) 

for some / e H(Q). By ( 2 ) and (4), 1 e [_g u . gj. 

We have shown that P(n — 1) implies P(n). Hence P(n) is true for all n. 
Finally, suppose G u . . . , G n e H(Q), and no G t is identically 0. (This 
involves no loss of generality.) Another application of Theorem 15.11 yields 
cp e H( Q) with m(cp ; a) = min m(G f ; a) for all a e Q. Put g t = Gjcp. Then g { e 
H( Q), and the functions g u ...,g n have no common zeros in Q. By P(n), 
\_g i, . . . , #„] = [1]. Hence [G x , . . . , GJ = [9]. This completes the proof. //// 
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Jensen’s Formula 

15.16 As we see from Theorem 15.11, the location of the zeros of a holomorphic 
function in a region Q is subject to no restriction except the obvious one concern- 
ing the absence of limit points in Q. The situation is quite different if we replace 
H(Q) by certain subclasses which are defined by certain growth conditions. In 
those situations the distribution of the zeros has to satisfy certain quantitative 
conditions. The basis of most of these theorems is Jensen’s formula (Theorem 
15.18). We shall apply it to certain classes of entire functions and to certain sub- 
classes of H(U). 

The following lemma affords an opportunity to apply Cauchy’s theorem to 
the evaluation of a definite integral. 


15.17 Lemma 


2 n 


*2 it 

log |1 d0 = 0. 

Jo 


Proof Let Q = (z: Re z < 1}. Since 1 — z ^ 0 in Q and Q is simply con- 
nected, there exists an h e H(Q) such that 

exp {h(z)} = 1 — z 

in Q, and this h is uniquely determined if we require that h( 0) = 0. Since 
Re (1 — z) > 0 in Q, we then have 

Re h(z) = log 1 1 — z | , | Im h(z) \ < ^ (zeQ). (1) 

For small <5 > 0, let T be the path 

T(t) = (<5 < t < 2n - <5), (2) 

and let y be the circular arc whose center is at 1 and which passes from e id to 
e~ iS within U. Then 


h f *'° 8 11 Re [i I* 2 > t] ” Re [i i*< 2) t] <3) 


The last equality depended on Cauchy’s theorem; note that h(0) = 0. 

The length of y is less than nd , so (1) shows that the absolute value of the 
last integral in (3) is less than Cd log (l/<5), where C is a constant. This gives 
the result if <5— ► 0 in (3). //// 


15.18 Theorem Suppose Q = D( 0; R), / e /( 0)^0, 0 < r < R, and <x l9 

. . . , oc N are the zeros of f in 5(0; r), listed according to their multiplicities. Then 

l/(0)l T«J “ exp {s I -,' 08 l/(re,,)l de \ 


( 1 ) 
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This is known as Jensens formula . The hypothesis /(0) ^ 0 causes no harm in 
applications, for if / has a zero of order k at 0 , the formula can be applied to 

m/2 k . 


Proof Order the points a 7 so that ot u . .., a m are in D( 0; r) and |a m + 1 1 = 
• • • = | <x N | = r. (Of course, we may have m = N or m = 0.) Put 


g(z) =/(z) 


n 


r 2 ~ 

H<*» - z) 


N 


n 



(2) 


Then g e H(D), where D = D(0; r + e) for some 6 > 0, # has no zero in D, 
hence log | g \ is harmonic in D (Theorem 13.12), and so 


log 1 0 ( 0 ) | 


.if' 

2n J-, 


log \g(re i9 )\ dO. 


(3) 


By (2), 


m 

| 0 (O)I = I/(O)| n 7 — r- (4) 

n = 1 I I 

For 1 < n < m, the factors in (2) have absolute value 1 if | z | = r. If oc n = re 10 " 
for m < n < N, it follows that 

log I g(re w ) I = log | f{re ie ) \ — £ log|l-^-«|. (5) 

n = m + 1 

Lemma 15.17 shows therefore that the integral in (3) is unchanged if g is 
replaced by f Comparison with (4) now gives (1). //// 

Jensen’s formula gives rise to an inequality which involves the boundary 
values of bounded holomorphic functions in U (we recall that the class of these 
functions has been denoted by if 00 ): 

15.19 Theorem Iff e H 00 ,/ not identically 0, define 

Hr( f ) = £ I” log I fire' 9 ) \de (0 < r < 1) (1) 

and 


H*(f) = 


2n 


* log | /V s ) I dd 

J ~ 71 


where f* is the radial limit function of f as in Theorem 11.32. Then 


Hff) < g s (f) if 0 < r < s < 1 , 
^Xf) log 1 /( 0 ) I as 0 , 


(2) 

(3) 

(4) 
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and 


H r (f) < n*(f ) if 0 < r < 1. (5) 

Note the following consequence: One can choose r so that /(z) # 0 if | z | = r; 
then n r {f) is finite, and so is ju*(/), by (5). Thus log |/* | e l}(T\ and f*(e ie ) ^ 0 
at almost every point of T. 

Proof There is an integer m > 0 such that f(z) = z m g(z\ g e H 00 , and g( 0) ^ 
0. Apply Jensen’s formula 15.18(1) to g in place of /. Its left side obviously 
cannot decrease if r increases. Thus p r (g) < p s (g) if r < s. Since 

nAf) = dXd) + rn log r. 


we have proved (3). 

Let us now assume, without loss of generality, that |/| < 1. Write f r (e ie ) 
in place of f(re ie ). Then / r -*/(0) as r— ► 0, and a.e. as r^> 1. Since 

log (l/|/ r |) > 0, two applications of Fatou’s lemma, combined with (3), give 
(4) and (5). //// 

15.20 Zeros of Entire Functions Suppose /is an entire function, 

M(r) = sup | f(re ie ) \ (0 < r < oo), (1) 

e 

and n(r) is the number of zeros of / in 5(0 ; r). Assume /( 0) = 1, for simplicity. 
Jensen’s formula gives 

r 1 r« n(2 r) 2 r n(r) 2 r 

M(2r) > exp ]— log \f(2re ie )\ d0\ = FI 1 — 7 ^ fl 1 — 7 ^ 2 " <r) . 

f27C J - n ) it=l I I n=l \^n\ 

if {a n } is the sequence of zeros of f arranged so that | a x | < | a 2 1 < • • • . Hence 

n(r) log 2 < log M(2r). (2) 

Thus the rapidity with which n(r) can increase (i.e., the density of the zeros of 
/) is controlled by the rate of growth of M(r). Suppose, to look at a more specific 
situation, that for large r 


M(r) < exp {;4r*} 

where A and k are given positive numbers. Then (2) leads to 

log n(r) 


lim sup 


log r 


< k. 


( 3 ) 

( 4 ) 


For example, if k is a positive integer and 

f(z) = 1 - e z \ 


( 5 ) 
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then n(r) is about n l kr k , so that 


lim 

r-* oo 


log n(r) 
log r 


This shows that the estimate (4) cannot be improved. 


( 6 ) 


Blaschke Products 

Jensen’s formula makes it possible to determine the precise conditions which the 
zeros of a nonconstant / e f/ 00 must satisfy. 


15.21 Theorem If {a„} is a sequence in U such that oc n ^O and 

£(i -!«„!)< oo. (l) 

n= 1 

if k is a nonnegative integer , and if 


n = 1 1 ~ 




(z e U), 


(2) 


then B e H™, and B has no zeros except at the points a„ (and at the origin, if 
k > 0). 


We call this function B a Blaschke product. Note that some of the a„ may be 
repeated, in which case B has multiple zeros at those points. Note also that each 
factor in (2) has absolute value 1 on T. 

The term “ Blaschke product ” will also be used if there are only finitely many 
factors, and even if there are none, in which case B(z) = 1. 


Proof The nth term in the series 

! a » ~ 2 1°U 

l-a„z a„ 

if | z | < r. Hence Theorem 15.6 shows that B e H(U) and that B has only the 
prescribed zeros. Since each factor in (2) has absolute value less than 1 in U, 
it follows that | B(z) | < 1, and the proof is complete. //// 

15.22 The preceding theorem shows that 

£( 1-|«,|)<00 

n= 1 


z 

n = 1 

<*„ + I «, 1 z 

(1 - a„ z)a„ 


( 1 ) 



ZEROS OF HOLOMORPHIC FUNCTIONS 311 


is a sufficient condition for the existence of an / e H 00 which has only the presc- 
ribed zeros {a„}. This condition also turns out to be necessary: Iff e H 00 and f is 
not identically zero , the zeros of f must satisfy (1). This is a special case of Theorem 
15.23. It is interesting that (1) is a necessary condition in a much larger class of 
functions, which we now describe. 

For any real number t , define log + t = log t if t > 1 and log + t = 0 if t < 1. 
We let N (for Nevanlinna) be the class of all f e H(U) for which 

sup I* log + | f(re ie ) I de < oo. (2) 

0 < r < 1 2ft J- n 

It is clear that H°° c= N. Note that (2) imposes a restriction on the rate of growth 
of | f(z) | as | z | — ► 1, whereas the boundedness of the integrals 

^| n iog| f(re ie )\dO (3) 

imposes no such restriction. For instance, (3) is independent of r if / = e 9 for any 
g e H(U). The point is that (3) can stay small because log |/| assumes large nega- 
tive values as well as large positive ones, whereas log + |/| >0. The class N will 
be discussed further in Chap. 17. 

15.23 Theorem Suppose f e N,f is not identically 0 in U, and a l5 a 2 , a 3 , . . . are 
the zeros of f listed according to their multiplicities . Then 

t (l-|«J)<co. (1) 

n= 1 

(We tacitly assume that / has infinitely many zeros in U. If there are only 
finitely many, the above sum has only finitely many terms, and there is nothing to 
prove. Also, |aj < |a w+1 |.) 

Proof If / has a zero of order m at the origin, and g(z) = z~ m f(z\ then g e N, 
and g has the same zeros as f except at the origin. Hence we may assume, 
without loss of generality, that /( 0) ^ 0. Let n(r) be the number of zeros of / 
in 5(0; r), fix k, and take r < 1 so that n(r) > k. Then Jensen’s formula 

|/(#)| n iij = exp {s [> l/(re " )l "} <2) 


implies that 


i/(0) in 


Kl 


< exp 


| log + \f(re ie )\ d0^. 


(3) 
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Our assumption that f e N is equivalent to the existence of a constant 
C < oo which exceeds the right side of (3) for all r, 0 < r < 1. It follows that 

II KI>C- 1 |/(0)|r\ (4) 

n — 1 

The inequality persists, for every k, as r— > 1. Hence 

n l«.l^c- 1 |/(0)|>0. (5) 

n= 1 

By Theorem 15.5, (5) implies (1). //// 

Corollary If f e //°° (or even if f e N), if cc l9 a 2 , a 3 , . . . are the zeros of f in U , 
and i/Z( 1 — | a„ | ) = oo, then f(z) = 0 for all z e U. 

For instance, no nonconstant bounded holomorphic function in U can have 
a zero at each of the points (n — 1 )/n (n = 1, 2, 3, . . .). 

We conclude this section with a theorem which describes the behavior of a 
Blaschke product near the boundary of U . Recall that as a member of H 00 , B has 
radial limits B*(e ie ) at almost all points of T. 

15.24 Theorem If B is a Blaschke product , then | B*(e id ) | = 1 a.e . and 

lim •— I log | B(re w ) \ d6 = 0. (1) 

r~* 1 27T J- n 


Proof The existence of the limit is a consequence of the fact that the integral 
is a monotonic function of r. Suppose B(z) is as in Theorem 15.21, and put 


B n (z) = 


oo 


n 


1 -5,2 



( 2 ) 


Since log ( | B/B N | ) is continuous in an open set containing T, the limit (1) is 
unchanged if B is replaced by B N . If we apply Theorem 15.19 to B N we 
therefore obtain 


log I B*(0) u lim f I log I B(re ie ) \d6<f\ log | B*(e m ) \dd<0. (3) 

r-1 J — n 27T 

As oo, the first term in (3) tends to 0. This gives (1), and shows that 
J log |£*| = 0. Since log |£*| < 0 a.e., Theorem 1.39(a) now implies that 
log | £* | = 0 a.e. //// 


The Miintz-Szasz Theorem 

15.25 A classical theorem of Weierstrass ([26], Theorem 7.26) states that the 
polynomials are dense in C(J), the space of all continuous complex functions on 
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the closed interval / = [0, 1], with the supremum norm. In other words, the set of 
all finite linear combinations of the functions 

i, t, t 2 , t 3 , ... (i) 

is dense in C(7). This is sometimes expressed by saying that the functions (1) span 
C(I). 

This suggests a question: If 0 < < k 2 < ^3 < * * * > under what conditions is 

it true that the functions 


1, t k \ t x \ ... 


(2) 


span C(/)? 

It turns out that this problem has a very natural connection with the 
problem of the distribution of the zeros of a bounded holomorphic function in a 
half plane (or in a disc; the two are conformally equivalent). The surprisingly neat 
answer is that the functions (2) span C(I) if and only ifL\/k n = 00. 

Actually, the proof gives an even more precise conclusion: 

15.26 Theorem Suppose 0 < k t < X 2 < k 3 < • • • and let X be the closure in 
C(7) of the set of all finite linear combinations of the functions 

1, t Xl 9 t k \ t k \ .... 

(a) IfL\/X n = 00, then X = C(J). 

(b) IfLl/k n < 00, and if X $ {k n } 9 k ^ 0, then X does not contain the function 
t k . 

Proof It is a consequence of the Hahn-Banach theorem (Theorem 5.19) that 
q> e C(I) but cp $ X if and only if there is a bounded linear functional on C(I) 
which does not vanish at q> but which vanishes on all of X. Since every 
bounded linear functional on C(7) is given by integration with respect to a 
complex Borel measure on 7, (a) will be a consequence of the following pro- 
position: 

IfL\/k n — 00 and if p is a complex Borel measure on I such that 

jV" dtft) = 0 {n = 1, 2, 3, ...), (1) 

then also 

JV dn(t) = 0 (*= 1,2,3,...). (2) 

For if this is proved, the preceding remark shows that X contains all 
functions t*; since lei, all polynomials are then in X, and the Weierstrass 
theorem therefore implies that X = C(7). 
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So assume that (1) holds. Since the integrands in (1) and (2) vanish at 0, 
we may as well assume that g is concentrated on (0, 1]. We associate with g 
the function 


m = jV dm (3) 

For t > 0, t z = exp (z log t ), by definition. We claim that / is holo- 
morphic in the right half plane. The continuity of / is easily checked, and we 
can then apply Morera’s theorem. Furthermore, if z = x + iy , if x > 0, and if 
0 < t < 1, then | t z \ = t x < 1. Thus / is bounded in the right half plane, and 
(1) says that f(k n ) = 0, for n = 1, 2, 3, Define 

& ( 2 ) ( z e u )- ( 4 ) 

Then g e H 00 and g(tx n ) = 0, where oc n = (k n — \)/{k n + 1). A simple computa- 
tion shows that Z(1 — |a„|) = 00 if El/A„ = 00 . The Corollary to Theorem 
15.23 therefore tells us that g(z) = 0 for all z e U. Hence / = 0. In particular, 
f(k) = 0 for k = 1, 2, 3, and this is (2). We have thus proved part (a) of 
the theorem. 

To prove ( b ) it will be enough to construct a measure g on I such that (3) 
defines a function / which is holomor phic in the half plane Re z > — 1 
(anything negative would do here), which is 0 at 0, ••• an d which 

has no other zeros in this half plane. For the functional induced by this 
measure g will then vanish on X but will not vanish at any function t x if 
k 7 ^ 0 and k $ {A„}. 

We begin by constructing a function / which has these prescribed zeros, 
and we shall then show that this / can be represented in the form (3). Define 


Since 


/(*) = 


z 

(2 + z ) 3 


n 


2 + k n + z 


k n — z _ 2z + 2 
2 + + z 2 ~h k n ~h z 


( 5 ) 


the infinite product in (5) converges uniformly on every compact set which 
contains none of the points — k n — 2. It follows that /is a meromorphic func- 
tion in the whole plane, with poles at — 2 and —k n — 2, and with zeros at 
0, k l9 k 2 , k 3 , — Also, each factor in the infinite product (5) is less than 1 in 
absolute value if Re z > —1. Thus |/(z)| < 1 if Re z > — 1. The factor 
(2 + z) 3 ensures that the restriction of / to the line Re z = — 1 is in L 1 . 

Fix z so that Re z > —1, and consider the Cauchy formula for /(z), 
where the path of integration consists of the semicircle with center at — 1, 
radius R > 1 + | z | , from — 1 — iR to — 1 + R to — 1 + iR, followed by the 
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interval from — 1 -MR to — 1 — iR. The integral over the semicircle tends to 
0 as R — ► oo, so we are left with 


But 


/(*)» 



/(-l + is) 

— 1 + is — z 


ds 


(Re z > - 1). 


1 

1 + z — is 



(Re z > - 1). 


Hence (6) can be rewritten in the form 


/(*) = 



/(- 1 


+ is)e ” is log ‘ ds 


dt. 


(6) 

(7) 


( 8 ) 


The interchange in the order of integration was legitimate : If the integrand in 
(8) is replaced by its absolute value, a finite integral results. 

Put g(s) =/(— 1 + is). Then the inner integral in (8) is.§(log t) 9 where g is 
the Fourier transform of g. This is a bounded continuous function on (0, 1], 
and if we set dg(t) = g ( log t) dt we obtain a measure which represents / in the 
desired form (3). 

This completes the proof. //// 


15.27 Remark The theorem implies that whenever {1, t x \ t Xl , ...} spans C(I), 
then some infinite subcollection of the t Xi can be removed without altering 
the span. In particular, C(l) contains no minimal spanning sets of this type. 
This is in marked contrast to the behavior of orthonormal sets in a Hilbert 
space : if any element is removed from an orthonormal set, its span is dimin- 
ished. Likewise, if (1, t x \ t Xl ...} does not span C(/), removal of any of its 
elements will diminish the span; this follows from Theorem 15.26(h). 


Exercises 

1 Suppose {a n } and {b n } are sequences of complex numbers such that £|a„ — b n \ < oo. On what sets 
will the product 


converge uniformly? Where will it define a holomorphic function? 

2 Suppose / is entire, X is a positive number, and the inequality 

|/(z)| <exp(|z| A ) 

holds for all large enough | z | . (Such functions / are said to be of finite order. The greatest lower 
bound of all X for which the above condition holds is the order of /.) If / (z) = 2.a n z", prove that the 
inequality 
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holds for all large enough n. Consider the functions exp (z*), k = 1, 2, 3, . . . , to determine whether the 
above bound on | a n | is close to best possible. 

3 Find all complex z for which exp (exp (z)) = 1. Sketch them as points in the plane. Show that there 
is no entire function of finite order which has a zero at each of these points (except, of course,/ = 0). 

4 Show that the function 


n cot nz = ni ■ 


e niz + e — niz 


-7IIZ _ — 71 1 Z 


e — e 

has a simple pole with residue 1 at each integer. The same is true of the function 
_ 1 * 2z £ 1 

/(*) = -+ Z - — 2= hm £ — • 

z n=l z 2 — n 2 N-»oo n=-N Z — n 

Show that both functions are periodic [/(z + 1) =/(z)], that their difference is a bounded entire 
function, hence a constant, and that this constant is actually 0, since 

lim 


J ' 00 dt 

7— 2 = ~ 7 
0 1 + 1 


This gives the partial fractions decomposition 


1 00 2 z 

k cot nz = - + X 3 — ~i • 


(Compare with Exercise 12, Chap. 9.) Note that n cot nz is {g'/g^z) if g{z) = sin nz. Deduce the 
product representation 


sin nz 
nz 




5 Suppose k is a positive integer, {z„} is a sequence of complex numbers such that E | z„ | * 
and 


< oo, 


/w- n e ( j ) 

n= 1 \ Z n/ 

(See Definition 15.7.) What can you say about the rate of growth of 

M(r) = max |/(re*)|? 
e 

6 Suppose /is entire, /(0) #0, |/(z)| < exp (|z| p ) for large |z|, and {z„} is the sequence of zeros of / 
counted according to their multiplicities. Prove that E|z„| -p-< < oo for every € > 0. (Compare with 
Sec. 15.20.) 

7 Suppose /is an entire function, /(^/n) = 0 forn = 1, 2, 3, . . . , and there is a positive coiistant a such 
that | / (z) | < exp (|z| a ) for all large enough |z|. For which a does it follow that /(z) = 0 for all z? 
[Consider sin {nz 2 ).~\ 

8 Let {z„} be a sequence of distinct complex numbers, z„ # 0, such that z„— ► oo as n— ► oo, and let 
{m n } be a sequence of positive integers. Let g be a meromorphic function in the plane, which has a 
simple pole with residue m n at each z„ and which has no other poles. If z £ {z„}, let y(z) be any path 
from 0 to z which passes through none of the points z„ , and define 

/(z) = ex pjf g(0dC 
Uy(z) 
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Prove that /(z) is independent of the choice of y(z) (although the integral itself is not), that /is 
holomorphic in the complement of {z„}, that / has a removable singularity at each of the points z„ , 
and that the extension of / has a zero of order m n at z n . 

The existence theorem contained in Theorem 15.9 can thus be deduced from the Mittag-Leffler 
theorem. 

9 Suppose 0<a<l, 0 < p < 1 ,/ e H(U),f(U) a U, and /( 0) = a. How many zeros can / have in the 

disc D{ 0; /?)? What is the answer if {a) a = j, ft = j; {b) a = *, p = j; (c) a = §, p = (d) a = 1/1,000, 

P = 1/10? 

10 For N = 1, 2, 3, ... , define 


Prove that the ideal generated by {g N } in the ring of entire functions is not a principal ideal. 

11 Under what conditions on a sequence of real numbers y n does there exist a bounded holomorphic 
function in the open right half plane which is not identically zero but which has a zero at each point 
1 + iy„? In particular, can this happen if (a) y n = log n, ( b ) y n = >/n,(c) y n = n,(d) y n = n 2 ? 

12 Suppose 0 < | a n | < 1, E(1 — | a„ | ) < oo, and B is the Blaschke product with zeros at the points a„ . 
Let E be the set of all points l/a„ and let be the complement of the closure of E. Prove that the 
product actually converges uniformly on every compact subset of Q, so that B e H(Q), and that B has 
a pole at each point of E. (This is of particular interest in those cases in which Q is connected.) 

13 Put a„ = 1 — n -2 , for n = 1, 2, 3, ..., let B be the Blaschke product with zeros at these points a, 
and prove that lim r _ t B(r) = 0. (It is understood that 0 < r < 1.) 

More precisely, show that the estimate 

N- 1 r — n N ~ 1 n — n 

|B(r)l< n fv;< n ^<2 e -"' 3 

! 1 - a„ r ! 1 - a„ 


is valid if a N _ x < r < a N . 

14 Prove that there is a sequence {a„} with 0 < a„ < 1, which tends to 1 so rapidly that the Blaschke 
product with zeros at the points a„ satisfies the condition 

lim sup | B(r)| = 1. 

r~* 1 


Hence this B has no radial limit at z = 1. 

15 Let q> be a linear fractional transformation which maps U onto U. For any z e U define the 

<p-orbit of z to be the set {<^ p„(z )}, where q> 0 {z) = z, <p n {z) = (p(<P n -i( z ))> n = 1, 2, 3, Ignore the case 

q>(z) = z. 

(< a ) For which <p is it true that the <p-orbits satisfy the Blaschke condition E(1 — | q> n {z ) |) < oo? 
[The answer depends in part on the location of the fixed points of (p. There may be one fixed point in 
U, or one fixed point on T, or two fixed points on T. In the last two cases it is advantageous to 
transfer the problem to (say) the upper half plane, and to consider transformations on it which either 
leave only oo fixed or leave 0 and oo fixed.] 

(b) For which (p do there exist nonconstant functions / e H 00 which are invariant under (p , i.e., 
which satisfy the relation f((p(z)) = f(z ) for all z e 17? 

16 Suppose | otj | < | a 2 1 < |a 3 1 < • • • < 1, and let n(r) be the number of terms in the sequence {a y } 
such that | ^ | < r. Prove that 


n(r)dr= £(1 -|<*jl)- 
Jo j= 1 

17 If B(z) = Ec fc z* is a Blaschke product with at least one zero off the origin, is it possible to have 
c fc > 0 for /c = 0, 1, 2,...? 
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18 Suppose B is a Blaschke product all of whose zeros lie on the segment (0, 1) and 

M = (z - 1 ) 2 B{z). 

Prove that the derivative of /is bounded in U. 

19 Put /(z) = exp [—(1 + z)/(l — z)]. Using the notation of Theorem 15.19, show that 

lim fi r (f) < 

r~* 1 

although /e 77 00 . Note the contrast with Theorem 15.24. 

20 Suppose Aj > k 2 > •••, and A„— >0 in the Miintz-Szasz theorem. What is the conclusion of the 
theorem, under these conditions? 

21 Prove an analogue of the Miintz-Szasz theorem, with L 2 (7) in place of C(7). 

22 Put f n {t) = t n e~ l (0 < t < oo, n = 0, 1, 2, . . .) and prove that the set of all finite linear combinations 
of the functions /„ is dense in L 2 (0, oo). Hint : If g e L 2 (0, oo) is orthogonal to each /„ and if 

F(z) = j* e- ,z tft)dt (Re z > 0), 

then all derivatives of F are 0 at z = 1. Consider F(1 + iy). 

23 Suppose Q r> £7,/e 77(fi), | f{e i9 )\ > 3 for all real 0,/( 0) = 0, and A t , A 2 , ..., X N are the zeros of 
1 — /in U, counted according to their multiplicities. Prove that 

I ^lA 2 ’ ’ ' I < 2- 

Suggestion : Look at B/( 1 — /), where B is a certain Blaschke product. 



CHAPTER 

SIXTEEN 

ANALYTIC CONTINUATION 


In this chapter we shall be concerned with questions which arise because func- 
tions which are defined and holomorphic in some region can frequently be 
extended to holomorphic functions in some larger region. Theorem 10.18 shows 
that these extensions are uniquely determined by the given functions. The exten- 
sion process is called analytic continuation. It leads in a very natural way to the 
consideration of functions which are defined on Riemann surfaces rather than in 
plane regions. This device makes it possible to replace “ multiple-valued 
functions” (such as the square-root function or the logarithm) by functions. A 
systematic treatment of Riemann surfaces would take us too far afield, however, 
and we shall restrict the discussion to plane regions. 

Regular Points and Singular Points 

16.1 Definition Let D be an open circular disc, suppose / e H(D), and let /? be 
a boundary point of D. We call /? a regular point of /if there exists a disc D x 
with center at /? and a function g e H(D X ) such that g(z) = /(z) for all z e D n 
D v Any boundary point of D which is not a regular point of / is called a 
singular point of f 

It is clear from the definition that the set of all regular points of / is an open 
(possibly empty) subset of the boundary of D. 

In the following theorems we shall take the unit disc U for D, without any 
loss of generality. 
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16.2 Theorem Suppose f e H(U), and the power series 

f(z)=ta n z » ( zeU ) (1) 

n = 0 

has radius of convergence 1. Then f has at least one singular point on the unit 
circle T. 


Proof Suppose, on the contrary, that every point of T is a regular point of f 
The compactness of T implies then that there are open discs D u ..., D n and 
functions gj e H(Dj) such that the center of each Dj is on T, such that T a 
Di u • • • u D n , and such that gfz) = / (z) in Dj , n U, for j = 1, . . . , n. 

If D { n Dj # 0 and V tj = D t n Dj n U , then ky # 0 (since the centers 
of the Dj are on T), and g t =f = gj in V tj . Since D { n Dj is connected, it 
follows from Theorem 10.18 that g ( = gj in D ( n Dj. Hence we may define a 
function fcinfi = (/uD 1 u ,,, u £)„by 


r/(z) (z e t/), 

U,<2) (Z 6 D i) 


Since Qid U and Q is open, there exists an e > 0 such that the disc 
D(0; 1 + e) c= Q. But h e H(Q\ h(z) is given by (1) in (7, and now Theorem 
10.16 implies that the radius of convergence of (1) is at least 1 + e, contrary 
to our assumption. //// 


16.3 Definition If / e H(U) and if every point of T is a singular point of f 
then T is said to be the natural boundary of / In this case, / has no holo- 
morphic extension to any region which properly contains U. 


16.4 Remark It is very easy to see that there exist / e H(U) for which T is a 
natural boundary. In fact, if Q is any region, it is easy to find an / e H(Q) 
which has no holomorphic extension to any larger region. To see this, let A 
be any countable set in Q which has no limit point in Q but such that every 
boundary point of Q is a limit point of A. Apply Theorem 15.11 to get a 
function / e H(Q) which is 0 at every point of A but is not identically 0. If 
g e //(QJ, where is a region which properly contains Q, and if g =/in Q, 
the zeros of g would have a limit point in Q u and we have a contradiction. 

A simple explicit example is furnished by 

/(z) = f ]z 2n = z + z 2 + z 4 + z 8 + ■■■ (z e U). (1) 

>1 = 0 

This / satisfies the functional equation 

f(z 2 ) =f(z) - z, (2) 

from which it follows (we leave the details to the reader) that / is unbounded 
on every radius of U which ends at exp {2nik/2 n }, where k and n are positive 
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integers. These points form a dense subset of T; and since the set of all 
singular points of /is closed, /has T as its natural boundary. 

That this example is a power series with large gaps (i.e., with many zero 
coefficients) is no accident. The example is merely a special case of Theorem 
16.6, due to Hadamard, which we shall derive from the following theore^n of 
Ostrowski: 


16.5 Theorem Suppose A, p k , and q k are positive integers , 

Pi <Pi <P3 < 


and 


Xq k >(X + l)p k (*-1,2,3,...). (1) 

Suppose 

/(z)= (2) 

n = 0 

has radius of convergence 1, and a n = 0 whenever p k < n < q k for some k. If 
s p (z) is the pth partial sum of (2), and if f is a regular point of f on T, then the 
sequence {s pk (z)} converges in some neighborhood of /?. 


Note that the full sequence {s p (z)} cannot converge at any point outside U. 
The gap condition (1) ensures the existence of a subsequence which converges in 
a neighborhood of /?, hence at some points outside U. This phenomenon is called 
overconvergence. 

Proof If g(z) = /(/?z), then g also satisfies the gap condition. Hence we may 
assume, without loss of generality, that /? = 1. Then / has a holomorphic 
extension to a region Q which contains U u {1}. Put 

<p{w) = 2 (w A + w A + x ) (3) 

and define F(w) = f(cp( w)) for all w such that <p(w) e Q. If | w | < 1 but w ^ 1, 
then | <p(w) | < 1, since 1 1 + w | < 2. Also, <p( 1) = 1. It follows that there exists 
an € > 0 such that <p(D( 0; 1 + e)) c Q. Note that the region <p(Z>( 0; 1 + €)) 
contains the point 1. The series 


F(w)= Z b m w m 

m — 0 


(4) 


converges if | w | < 1 + €. 
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The highest and lowest powers of w in [<p(w)] n have exponents (X + 1 )n 
and Xn. Hence the highest exponent in [<p(w)] Pk is less than the lowest expo- 
nent in |>(w)]*\ by (1). Since 


F(w) = X al<p(w)Y ( I w 1 < 1), (5) 

n = 0 

the gap condition satisfied by { a n } now implies that 

Pk (*+l )Pk 

!«,[#)]”= Z b m w m (* = 1,2,3,...). (6) 

n-0 m=0 

The right side of (6) converges, as fc— ► oo, whenever | w | < 1 + e. Hence 
(s pk (z)} converges for all z e cp(D( 0; 1 + e)). This is the desired conclusion. //// 

Note : Actually, (s pk (z)} converges uniformly in some neighborhood of /?. We 
leave it to the reader to verify this by a more careful examination of the preceding 
proof. 

16.6 Theorem Suppose X is a positive integer , {p k } is a sequence of positive 
integers such that 

Pk+i >^1 + jjpk (k= 1,2,3,...), (1) 


and the power series 


f(z)= ickZ" (2) 

fc = 1 

has radius of convergence 1. Then fhas T as its natural boundary. 

Proof The subsequence {s pk } of Theorem 16.5 is now the same (except for 
repetitions) as the full sequence of partial sums of (2). The latter cannot con- 
verge at any point outside U ; hence Theorem 16.5 implies that no point of T 
can be a regular point of /. //// 

16.7 Example Put a n = 1 if n is a power of 2, put a n = 0 otherwise, put rj n = 
exp ( — ^/n), and define 


m = 


OO 


Z a n In z"- 

n = 0 


(l) 


Since 


lim sup \a n rin\ lln = 1, 

n-> oo 


( 2 ) 
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the radius of convergence of (1) is 1. By Hadamard’s theorem, /has T as its 
natural boundary. Nevertheless, the power series of each derivative of f 

r k \z) = £ n(n - 1) • • • (n - k + 1 )a n n , z* - * (3) 

n — k 

converges uniformly on the closed unit disc. Each f (k) is therefore uniformly 
continuous on U, and the restriction of / to T is infinitely differentiable, as a 
function of 6 , in spite of the fact that T is the natural boundary of /. 

The example demonstrates rather strikingly that the presence of singularities, 
in the sense of Definition 16.1, does not imply the presence of discontinuities or 
(stated less precisely) of any lack of smoothness. 

This seems to be the natural place to insert a theorem in which continuity 
does preclude the existence of singularities : 

16.8 Theorem Suppose Q is a region , L is a straight line or a circular arc , 
Q — L is the union of two regions and Q 2 > / is continuous in Q, and f is 
holomorphic in and in Zl 2 • Then f is holomorphic in Q. 

Proof The use of linear fractional transformations shows that the general 
case follows if we prove the theorem for straight lines L. By Morera’s 
theorem, it is enough to show that the integral of / over the boundary dA is 0 
for every triangle A in Q. The Cauchy theorem implies that the integral of / 
vanishes over every closed path y in A n Q 1 or in A n Q 2 . The continuity of 
/ shows that this is still true if part of y is in L, and the integral over dA is the 
sum of at most two terms of this sort. //// 

Continuation along Curves 

16.9 Definitions A function element is an ordered pair (f D), where D is an 
open circular disc and / e H(D). Two function elements (/ 0 , D 0 ) and (f l9 D x ) 
are direct continuations of each other if two conditions hold: D 0 n Dj ^ 0, 
and / 0 (z) = f^z) for all z e D 0 n D 1 . In this case we write 

(/o,0o)~(/i,0i). (1) 

A chain is a finite sequence # of discs, say # = { D 0 , D l9 ..., D n }, such that 
_ x n D t 7^ 0 for i = 1, . . . , n. If (/ 0 , D 0 ) is given and if there exist elements 
(f , D t ) such that (/ £ _ l5 D t _ x ) ~ (f , D f ) for i = 1, . . . , n, then ( f n , D n ) is said to 
be the analytic continuation of (f 0 , D 0 ) along Note that f n is uniquely deter- 

mined by / 0 and # (if it exists at all). To see this, suppose (1) holds, and 
suppose (1) also holds with g± in place of / x . Then g± =f 0 = / x in D 0 n D x ; 
and since D x is connected, we have g x = / x in D t . The uniqueness of /„ now 
follows by induction on the number of terms in <€. 
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If ( f n , D n ) is the continuation of (/ 0 , D 0 ) along and if D n n D 0 ^ 0, it 
need not be true that (/ 0 , D 0 ) ~ (/„, D n ); in other words, the relation ~ is not 
transitive. The simplest example of this is furnished by the square-root func- 
tion: Let D 0 , D u and D 2 be discs of radius 1, with centers 1, co , and m 2 , 
where co 3 = 1, choose f i e H(Dj) so that fj(z) — z and so that (/ 0 , D 0 ) ~ 
(fu ^i), (fu ^>i) ~ (fi , ^ 2 )- In D 0 n D 2 we have/ 2 = -f 0 *f 0 . 

A chain # = {D 0 , . . . , D„} is said to cover a curve y with parameter inter- 
val [0, 1] if there are numbers 0 = s 0 < s x < • • • < s n = 1 such that y(0) is the 

center of D 0 , y(l) is the center of D n , and 

y&Si , s i+ 1 ]) c D t (i = 0, . . . , n - 1). (2) 

If (/ 0 , D 0 ) can be continued along this # to (/„ , D„), we call (/„ , D n ) an 

analytic continuation of ( f 0 , D 0 ) along y (uniqueness will be proved in 
Theorem 16.11); (/ 0 , D 0 ) is then said to admit an analytic continuation along 

y- 

Although the relation (1) is not transitive, a restricted form of transitivity 
does hold. It supplies the key to the proof of Theorem 16.11. 

16.10 Proposition Suppose that D 0 n D x n D 2 ^ 0, (D 0 ,/ 0 ) ~ and 

(Du h) ~ (D 2 , / 2 ). Then (D 0 , / 0 ) - (D 2 , / 2 ). 

Proof By assumption, / 0 = f v in D 0 n and / x = / 2 in D x n D 2 . Hence 
/o = fi m the nonempty open set D 0 n D l n D 2 . Since / 0 and f 2 are holo- 
morphic in D 0 n D 2 and D 0 n D 2 is connected, it follows that / 0 = f 2 in 

D 0 n D 2 . //// 

16.11 Theorem If (f D) is a function element and if y is a curve which starts at 
the center of D, then (f D) admits at most one analytic continuation along y. 

Here is a more explicit statement of what the theorem asserts : If y is covered 
by chains = {A 0 , A lt . . . , A m } and = {B 0 , B u . . . B„}, where A 0 = B 0 = D, 
if (f D) can be analytically continued along to a function element (g m , A m ), 
and if (f D) can be analytically continued along # 2 to ( h n , B n \ then g m = h n in 
A m B„. 

Since A m and B n are, by assumption, discs with the same center y(l), it follows 
that g m and h n have the same expansion in powers of z — y(l), and we may as well 
replace A m and B n by whichever is the larger one of the two. With this agreement, 
the conclusion is that g m — h n . 

Proof Let and ( € 2 be as above. There are numbers 
0 = s 0 <Sj < ••• <s m = 1 =s m + 1 
and 0 = <r 0 < a 1 < • < o n = 1 = o n + x such that 

?([S(, s. + i]) c A„ yilaj, <r y+1 ]) c Bj (0 < i < m, 0 <j < n). (1) 
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There are function elements (g i9 A ( ) ~ (g i+l9 A i+i ) and ( h j9 Bj) ~ (h j+l9 B j+1 ), 
for 0 < i < m — 1 and 0 <j < n — 1. Here g 0 = h 0 = /. 

We claim that if 0 < i < m and 0 < j < n 9 and if [s f , s l + 1 ] intersects 
[>;, ff/+i]» th en (&, - ( h j9 Bj). 

Assume there are pairs (i 9 y) for which this is wrong. Among them there is 
one for which i + j is minimal. It is clear that then i + j > 0. Suppose s t > a } . 
Then i > 1, and since [s f , s i+1 ] intersects [ a j9 we see that 

y(Si) e n A t n Bj. (2) 

The minimality of i + j shows that _ i) ~ (h j9 Bj); and since 

(gfi-x, >1 £ _ i) — (g i9 A t ), Proposition 16.10 implies that — ( h j9 Bj). This 

contradicts our assumption. The possibility s t < cr 7 is ruled out in the same 
way. 

So our claim is established. In particular, it holds for the pair (m, n), and 
this is what we had to prove. //// 

16.12 Definition Suppose a and /? are points in a topological space X and cp 
is a continuous mapping of the unit square I 2 = I x / (where / = [0, 1]) into 
X such that <p(0, t) = a and <p(l, t) = /? for all t e I. The curves defined by 

y,(s) = <p(s, 0 (5 e /, t e I) (1) 

are then said to form a one-parameter family {y f } of curves from cl to in X. 

We now come to a very important property of analytic continuation: 

16.13 Theorem Suppose {yj (0 < t < 1) is a one-parameter family of curves 
from cl to P in the plane , D is a disc with center at a, and the function element 
(f D) admits analytic continuation along each y t9 to an element (g t9 D t ). Then 
9i ~ 9o • 

The last equality is to be interpreted as in Theorem 16.1 1 : 

(#i> ^i) ~ (0o > Do), 

and D 0 and D x are discs with the same center, namely, /?. 

Proof Fix tel. There is a chain # = {A 09 ..., A n } which covers y t9 with 
A 0 = D, such that (g t9 D t ) is obtained by continuation of (/, D) along ( €. 
There are numbers 0 = s 0 < • < s„ = 1 such that 

= 7r(l>i , Si + 1 ]) c (i = 0, 1, . . . , n - 1). (1) 

There exists an e > 0 which is less than the distance from any of the compact 
sets E t to the complement of the corresponding open disc A t . The uniform 
continuity of cp on / 2 (see Definition 16.12) shows that there exists a S > 0 
such that 


\y t (s)-y»(s)\<e 


if s e /, u e /, | u — t \ < S. 


(2) 
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Suppose u satisfies these conditions. Then (2) shows that # covers y u , and 
therefore Theorem 16.11 shows that both g t and g u are obtained by continua- 
tion of (f D) along this same chain Hence g t = g u . 

Thus each tel is covered by a segment J t such that g u = g t for all u e 
I n J t . Since / is compact, / is covered by finitely many J t ; and since / is 
connected, we see in a finite number of steps that g Y = g 0 . //// 

Our next item is an intuitively obvious topological fact. 

16.14 Theorem Suppose T 0 and r x are curves in a topological space X , with 
common initial point oc and common end point p. IfX is simply connected , then 
there exists a one-parameter family {yj (0 < t < l) of curves from oc to P in X , 
such that y 0 = r 0 and y x = 


Proof Let [0, 7 r] be the parameter interval of T 0 and r x . Then 

rn = { r °( S ) (0<S<7l) 

jr !( 27 i — s) (it < s < 2n) 


(1) 


defines a closed curve in X. Since X is simply connected, T is null-homotopic 
in X. Hence there is a continuous H : [0, 2 it] x [0, 1] — » X such that 

H(s, 0) = T(s), H(s, 1) = c e X, H( 0, t) = H(2n, t). (2) 

If d>: 0— * X is defined by 

<&(re m ) = H(6, 1 - r) (0 < r < 1, 0 <S 6 < 2n), 

(2) implies that <I> is continuous. Put 

y,(6) = <D[(1 - t)e ie + te" i# ] (0 < 0 < it, 0 < t < 1). 

Since 0(e i# ) = H(9, 0) = T(0), it follows that 

y,(0) = m = HO) = a (0 < t < 1), 

7,(71) = <sx-i) = r(n) = p (o < t < i), 

y 0 (6) = QKe") = T(0) = r 0 (6) (0 < 6 < n) 

and 

yi (0) = <&(e~ ie ) = <&(e ii2n ~ 0) ) = r(27i - 6) = T^O) (0 < 6 < n). 

This completes the proof. //// 


The Monodromy Theorem 

The preceding considerations have essentially proved the following important 
theorem. 
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16.15 Theorem Suppose Q is a simply connected region , (f D) is a function 
element , DcQ, and (f D) can be analytically continued along every curve in Q, 
that starts at the center of D. Then there exists g e H(Q) such that g(z) = /(z) 
for all z e D. 


Proof Let T 0 and r x be two curves in Q from the center a of D to some 
point p e Q. It follows from Theorems 16.13 and 16.14 that the analytic con- 
tinuations of (f D) along r 0 and r x lead to the same element (g p , Dp), where 
Dp is a disc with center at ft. If Dp t intersects Dp, then (g Pl , Dp t ) can be 
obtained by first continuing (f D) to /?, then along the straight line from ft to 
P x . This shows that g Pl = g fi in D Pl n Dp. 

The definition 


g(z) = gp(z) (z e £>„) 

is therefore consistent and gives the desired holomorphic extension of /. //// 


16.16 Remark Let Q be a plane region, fix w Q, let D be a disc in Q. Since 
D is simply connected, there exists / e H(D) such that exp [/(z)] = z — w. 
Note that f'(z) = (z — w)" 1 in D, and that the latter function is holomorphic 
in all of Q. This implies that (f D) can be analytically continued along every 
path y in Q that starts at the center a of D : If y goes from a to ft, if D p = 
D{p ; r) c Q, if 


r z = y 4- [ft z] (z e Dp) 


( 1 ) 


and if 


gfa) = J r (C - w) - ■ 1 dt: +/(«) (z e D ff ), (2) 

then (| gp , D p ) is the continuation of (/, D) along y. 

Note that g'p(z) = (z — w)“ 1 in Dp . 

Assume now that there exists # e //(Q) such that g(z) = /(z) in D. Then 
g'(z) = (z — w)~ 1 for all z e Q. If T is a closed path in Q, it follows that 

Ind r (w) = f I g’(z) dz = 0. (3) 

2m J r 

We conclude (with the aid of Theorem 13.11) that the monodr omy theorem 
fails in every plane region that is not simply connected. 
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Construction of a Modular Function 


16.17 The Modular Group This is the set G of all linear fractional transform- 
ations (p of the form 


<*>00 = 


az + b 
cz + d 


(i) 


where a, b , c , and d are integers and ad — be = 1. 

Since a, b , c, and d are real, each <peG maps the real axis onto itself (except 
for oo ). The imaginary part of q>(i) is ( c 2 + d 2 )~ 1 >0. Hence 

cp(U + ) = U + ((p e G), (2) 

where II + is the open upper half plane. If (p is given by (1), then 


<p-» 


dw — b 
— cw + a 


( 3 ) 


so that cp ~ 1 e G. Also <p°^eGif<peG and if/ e G. 

Thus G is a group, with composition as group operation. In view of (2) it is 
customary to regard G as a group of transformations on II + . 

The transformations z—>z + 1 (a = b = d = 1, c = 0) and z— > — 1/z 
(a = d = 0, b = — 1, c — 1) belong to G. In fact, they generate G (i.e., there is no 
proper subgroup of G which contains these two transformations). This can be 
proved by the same method which will be used in Theorem 16.19(c). 

A modular function is a holomorphic (or meromorphic) function / on II + 
which is invariant under G or at least under some nontrivial subgroup T of G. 
This means that / ° (p = /for every (p e T. 


16.18 A Subgroup We shall take for T the group generated by c and t, where 

tf(z) = 2z\v = z + 1 (!) 

One of our objectives is the construction of a certain function X which is invari- 
ant under T and which leads to a quick proof of the Picard theorem. Actually, it 
is the mapping properties of X which are important in this proof, not its invari- 
ance, and a quicker construction (using just the Riemann mapping theorem and 
the reflection principle) can be given. But it is instructive to study the action of T 
on II + , in geometric terms, and we shall proceed along this route. 

Let Q be the set of all z which satisfy the following four conditions, where 
z = x + iy: 

y> 0, -1<x< 1, 1 2z + 1 1 > 1, | 2z — 1 1 > 1. (2) 

Q is bounded by the vertical lines x = — 1 and x = 1 and is bounded below 
by two semicircles of radius j, with centers at — ^ and at Q contains those of 
its boundary points which lie in the left half of II + . Q contains no point of the 
real axis. 
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We claim that Q is a fundamental domain of Y. This means that statements (a) 
and (b) of the following theorem are true. 


16.19 Theorem Let Y and Q be as above . 


(a) If (p i and and cp± # <p 2 , then cp^Q) n <p 2 (0 = 0. 

(b) lUr<P(0 = n + . 

(c) T contains all transformations cp g G of the form 


(p(z) = 


az + b 
cz + d 


for which a and d are odd integers , b and c are even. 


(1) 


Proof Let r x be the set of all cp e G described in (c). It is easily verified that 
r\ is a subgroup of G. Since g g Y t and t g T^ it follows that Y a Y x . To 
show that T = T 1} i.e., to prove (c), it is enough to prove that (a!) and (b) 
hold, where (a') is the statement obtained from (a) by replacing Y by T^ For 
if (a!) and ( b ) hold, it is clear that Y cannot be a proper subset of Y v 
We shall need the relation 


Im (p(z) — 


Im z 
\cz + d\ 2 


(2) 


which is valid for every cp g G given by (1). The proof of (2) is a matter of 
straightforward computation, and depends on the relation ad — be = 1. 

We now prove (a'). Suppose cp x and cp 2 e Y u <p x # <p 2 , and define cp = 
cpi 1 °cp 2 . If z g cpi(Q) n cp 2 {Q\ then cp^iz) g Q n cp(Q). It is therefore 
enough to show that 


Q n <p(Q) = 0 (3) 

if cp g r x and cp is not the identity transformation. 

The proof of (3) splits into three cases. 

If c = 0 in (1), then ad = 1, and since a and d are integers, we have 
a = d = ±1. Hence cp(z) = z + 2n for some integer n ^ 0, and the description 
of Q makes it evident that (3) holds. 

If c = 2d , then c = ±2 and d = ± 1 (since ad — be = 1). Therefore 
cp(z) = a(z) + 2m, where m is an integer. Since g(Q ) a |), (3) holds. 

If c # 0 and c # 2d , we claim that | cz + d | > 1 for all z e Q. Otherwise, 
the disc D( — d/c ; 1/ 1 c | ) would intersect Q. The description of Q shows that if 
a # — j is a real number and if S(a; r) intersects Q , then at least one of the 
points — 1, 0, 1 lies in D( a; r). Hence | cw + d \ < 1, for w = — 1 or 0 or 1. But 
for these w, cw + d is an odd integer whose absolute value cannot be less 
than 1. So | cz + d\ > 1, and it now follows from (2) that Im cp(z) < Im z for 



330 REAL AND COMPLEX ANALYSIS 


every z e Q. If it were true for some z e Q that (p(z) e Q , the same argument 
would apply to cp ~ 1 and would show that 

Im z = Im <p _1 (<p(z )) < Im cp(z). (4) 

This contradiction shows that (3) holds. 

Hence ( a ') is proved. 

To prove (b), let £ be the union of the sets (p(Q), for cp e T. It is clear that 
£ <= n + . Also, £ contains the sets t n (Q), for n — 0, ±1, ±2,..., where 
t n (z) = z + 2n. Since o maps the circle 1 2z + 1 1 = 1 onto the circle 
1 2z — 1| = 1, we see that £ contains every z e II + which satisfies all inequal- 
ities 


|2z-(2m+ 1)| > 1 (m = 0, ±1, ±2,...). (5) 

Fix w e n + . Since Im w > 0, there are only finitely many pairs of inte- 
gers c and d such that | cw + d | lies below any given bound, and we can 
choose (p 0 e T so that | cw + d \ is minimized. By (2), this means that 

Im cp(w) < Im (p 0 (w) (cp e T). (6) 

Put z = <p 0 (w ). Then (6) becomes 

Im <p(z ) < Im z (cp £ T). (7) 

Apply (7) to cp — <jx~ n and to cp = o~ l x~ n . Since 


(<7T-")(Z) 


z — In 
2z — An + 1 ’ 


((7 _1 T _n )(z) 


z — 2n 

— 2z 4- An -h 1 ’ 


( 8 ) 


it follows from (2) and (7) that 

|2z-4n+ 1|> 1, | 2z — 4n — 1 1 > 1 (n = 0, ±1, ±2,...). (9) 

Thus z satisfies (5), hence ze£; and since w = cp^ 1 (z) and <p 0 " 1 e T, we have 
w e £. 

This completes the proof. //// 


The following theorem summarizes some of the properties of the modular 
function X which was mentioned in Sec. 16.18 and which will be used in Theorem 
16.22. 


16.20 Theorem IfY and Q are as described in Sec . 16.18, there exists a func- 
tion X e H(Tl + ) such that 

(a) X o (p = X for every cp e T. 

(b) X is one-to-one on Q. 

(c) The range Q of X [ which is the same as X(Q '), by (a)], is the region consist- 
ing of all complex numbers different from 0 and 1. 

(d) X has the real axis as its natural boundary. 
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Proof Let Q 0 be the right half of Q. More precisely, Q 0 consists of all 
z e II + such that 


0 < Re z < 1, 1 2z — 1| > 1. (1) 

By Theorem 14.19 (and Remarks 14.20) there is a continuous function h on 
Q 0 which is one-to-one on Q 0 and holomorphic in Q 0 , such that h(Q 0 ) = II + , 
h( 0) = 0, h( 1) = 1, and h(oo) = oo. The reflection principle (Theorem 11.14) 
shows that the formula 

h(-x + iy) = h(x + iy) (2) 

extends h to a continuous function on the closure Q of Q which is a confor- 
mal mapping of the interior of Q onto the complex plane minus the non- 
negative real axis. We also see that h is one-to-one on Q , that h(Q) is the 
region Q described in (c), that 

h (- 1 + iy) = h( 1 + iy) = h(x {- 1 + iy)) (0 < y < oo), (3) 

and that 

h(- i + je w ) = h( i + ±e i(n - 0) ) = h(<r(- i + ie w )) (0 < 6 < n). (4) 

Since h is real on the boundary of Q, (3) and (4) follow from (2) and the 
definitions of a and t. 

We now define the function 2: 

a(z) = %>-‘(z)) (z e <?(©, e n. (5) 

By Theorem 16.19, each z e IT + lies in cp(Q) for one and only on£ cp e T. 
Thus (5) defines 2(z) for z e II + , and we see immediately that X has properties 
(a) to (c) and that X is holomorphic in the interior of each of the sets cp(Q ). 

It follows from (3) and (4) that X is continuous on 

Q u *~\Q) u er-^G), 

hence on an open set K which contains Q. Theorem 16.8 now shows that X is 
holomorphic in V. Since II + is covered by the union of the sets <p(V), cp e T, 
and since X o (p = X, we conclude that X e H{ II + ). 

Finally, the set of all numbers cp( 0) = b/d is dense on the real axis. If X 
could be analytically continued to a region which properly contains IT + , the 
zeros of X would have a limit point in this region, which is impossible since X 
is not constant. //// 

The Picard Theorem 

16.21 The so-called “little Picard theorem” asserts that every nonconstant entire 
function attains each value, with one possible exception. This is the theorem 
which is proved below. There is a stronger version: Every entire function which is 
not a polynomial attains each value infinitely many times , again with one possible 
exception. That one exception can occur is shown by /(z) = e z 9 which omits the 
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value 0. The latter theorem is actually true in a local situation: Iff has an isolated 
singularity at a point z 0 and if f omits two values in some neighborhood of z 0 , then 
z 0 is a removable singularity or a pole of f This so-called “ big Picard theorem ” is 
a remarkable strengthening of the theorem of Weierstrass (Theorem 10.21) which 
merely asserts that the image of every neighborhood of z 0 is dense in the plane if / 
has an essential singularity at z 0 . We shall not prove it here. 

16.22 Theorem If f is an entire function and if there are two distinct complex 
numbers a and ft which are not in the range of f then f is constant . 


Proof Without loss of generality we assume that a = 0 and P = 1 ; if not, 
replace / by (/ — a)/(/? — a). Then / maps the plane into the region Q 
described in Theorem 16.20. 

With each disc D x c Q there is associated a region V x c: II + (in fact, 
there are infinitely many such V l9 one for each <p e T) such that X is one-to- 
one on V 1 and A(P 1 ) = D 1 ; each such V x intersects at most two of the 
domains cp(Q ). Corresponding to each choice of V x there is a function e 
H(D X ) such that ip x (X(z)) = z for all z e V v 

If D 2 is another disc in Q and if D x n D 2 ^ 0, we can choose a corre- 
sponding V 2 so that V t n V 2 j=- 0. The function elements (^ 1} D x ) and 
(ip 2 , D 2 ) will then be direct analytic continuations of each other. Note that 

HD,) <= n + . 

Since the range of / is in Q, there is a disc A 0 with center at 0 so that 
f(A 0 ) lies in a disc D 0 in Q. Choose i^ 0 G H(D 0 ), as above, put g{z) = i Ao(/( z )) 
for z e A 0 , and let y be any curve in the plane which starts at 0. The range of 
/ o y is a compact subset of Q. Hence y can be covered by a chain of discs, 
A 0i ..., A ni so that each f(A lies in a disc D t in Q, and we can choose 
\l/ t e H(Di) so that (^ f , D t ) is a direct analytic continuation of(^ I _ 1 , x ), for 

i = 1, ...n. This gives an analytic continuation of the function element 
(g 9 A 0 ) along the chain { A 0 , ..., A n }; note that o / has positive imaginary 
part. 

Since (g, A 0 ) can be analytically continued along every curve in the plane 
and since the plane is simply connected, the monodromy theorem implies 
that g extends to an entire function. Also, the range of g is in II + , hence 
(9 ~ Mg + 0 is bounded, hence constant, by Liouville’s theorem. This shows 
that g is constant, and since \j/ 0 was one-to-one on f(A 0 ) and A 0 was a non- 
empty open set, we conclude that /is constant. //// 

Exercises 

1 Suppose /(z) = 2a n z", a n > 0, and the radius of convergence of the series is 1. Prove that / has a 
singularity at z = 1. Hint : Expand / in powers of z — j. If 1 were a regular point of /, the new series 
would converge at some x > 1. What would this imply about the original series? 


2 Suppose (/, D ) and ( g , D) are function elements, P is a polynomial in two variables, and P(/, g) = 0 
in D. Suppose / and g can be analytically continued along a curve y, to f x and g x . Prove that 
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P (/ l5 0 i) = 0. Extend this to more than two functions. Is there such a theorem for some class of 
functions P which is larger than the polynomials? 

3 Suppose ft is a simply connected region, and u is a real harmonic function in ft. Prove that there 
exists an / e H(Q) such that u = Re /. Show that this fails in every region which is not simply con- 
nected. 

4 Suppose X is the closed unit square in the plane, /is a continuous complex function on X , and / has 
no zero in X. Prove that there is a continuous function g on X such that / = e 9 . For what class of 
spaces X (other than the above square) is this also true? 

5 Prove that the transformations z — >z + 1 and z— ► — 1/z generate the full modular group G. Let R 
consist of all z = x 4 - iy such that |x| < y > 0 , and |z| > 1 , plus those limit points which have 
x < 0. Prove that R is a fundamental domain of G. 

6 Prove that G is also generated by the transformations q> and ij/, where 

1 z — 1 

<P(z) = , i//(z) = . 

z z 

Show that (p has period 2, \J/ has period 3. 

7 Find the relation between composition of linear fractional transformations and matrix multiplica- 
tion. Try to use this to construct an algebraic proof of Theorem 16.19(c) or of the first part of 
Exercise 5. 

8 Let £ be a compact set on the real axis, of positive Lebesgue measure, let ft be the complement of 
E, relative to the plane, and define 


/(*)= 


I 


dt 


t — z 


(Z e ft). 


Answer the following questions: 

(a) Is / constant? 

(b) Can /be extended to an entire function? 

(c) Does lim z/(z) exist as z— ► oo? If so, what is it? 

(i d) Does / have a holomorphic square root in ft? 

( e ) Is the real part of /bounded in ft? 

(/) Is the imaginary part of /bounded in ft? 

[If “ yes ” in ( e ) or (/), give a bound.] 

(i g ) What is J y /(z) dz if y is a positively oriented circle which has E in its interior? 
{h) Does there exist a bounded holomorphic function q> in ft which is not constant? 

9 Check your answers in Exercise 8 against the special case 


E = [-l 1]. 


10 Call a compact set E in the plane removable if there are no nonconstant bounded holomorphic 
functions in the complement of E. 

(a) Prove that every countable compact set is removable. 

{b) If £ is a compact subset of the real axis, and m(£) = 0, prove that £ is removable. Hint : £ 
can be surrounded by curves of arbitrarily small total length. Apply Cauchy’s formula, as in Exercise 
25, Chap 10. 

(c) Suppose £ is removable, ft is a region, £ c: ft, / e H(Q — £), and / is bounded. Prove that / 
can be extended to a holomorphic function in ft. 

{d) Formulate and prove an analogue of part ( b ) for sets £ which are not necessarily on the real 

axis. 

(e) Prove that no compact connected subset of the plane (with more than one point) is remov- 
able. 
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170 = 


a + * 


11 For each positive number a, let T a be the path with parameter interval (— oo, oo) defined by 

-t — ni ( — oo < t < — a), 
nit 

— ( — a < t < a), 

a 

U + ni ( a < t < oo ). 

Let be the component of the complement of TJ which contains the origin, and define 

>(0 


m 


1 P exp ( e 

2ni Jr, w ~ ^ 


■ dw (z e (1J. 


Prove that f fi is an analytic continuation of f a if a < p. Prove that therefore there is an entire function / 
whose restriction to is f a . Prove that 

lim f(re ie ) = 0 


for every e ie # l. (Here r is positive and 6 is real, as usual.) Prove that /is not constant. [Hint: Look 
at fm If 


prove that 


g =/exp (— /), 

lim g{re ie ) = 0 


for every e ie . 

Show that there exists an entire function h such that 


12 Suppose 


lim h(nz ) = 

n-* oo 


s 


if z = 0, 
if z # o. 


oo 


/(*) = I 

k = 1 





Find the regions in which the two series converge. Show that this illustrates Theorem 16.5. Find the 
singular point of / which is nearest to the origin. 

13 Let Q = {z: \ < \ z\ < 2}. For n = 1, 2, 3, ... let X n be the set of all / e H{Q) that are nth deriv- 
atives of some g e H(Q). [In other words, X n is the range of the differential operator D n with domain 
tf(fl).] 

(a) Show that / e X { if and only if J y / (z) dz — 0, where y is the positively oriented unit circle. 

{b) Show that / e X n for every n if and only if / extends to a holomorphic function in £>( 0; 2). 

14 Suppose Q is a region, p e Q, R < o o. Let & be the class of all / e H(Q) such that \f(p)\ < R and 
/(ft) contains neither 0 nor 1. Prove that & is a normal family. 

15 Show that Theorem 16.2 leads to a very simple proof of the special case of the monodromy 
theorem (16.15) in which ft and D are concentric discs. Combine this special case with the Riemann 
mapping theorem to prove Theorem 16.15 in the generality in which it is stated. 



CHAPTER 


SEVENTEEN 

H p - SPACES 


This chapter is devoted to the study of certain subspaces of H(U) which are 
defined by certain growth conditions; in fact, they are all contained in the class N 
defined in Chap. 15. These so-called H p - spaces (named for G. H. Hardy) have a 
large number of interesting properties concerning factorizations, boundary 
values, and Cauchy-type representations in terms of measures on the boundary of 
U. We shall merely give some of the highlights, such as the theorem of F. and M. 
Riesz on measures n whose Fourier coefficients fi(n) are 0 for all n < 0, Beurling’s 
classification of the invariant subspaces of f/ 2 , and M. Riesz’s theorem on conju- 
gate functions. 

A convenient approach to the subject is via subharmonic functions, and we 
begin with a brief outline of their properties. 

Subharmonic Functions 

17.1 Definition A function u defined in an open set Q in the plane is said to 

be subharmonic if it has the following four properties. 

{a) — oo < u(z) < oo for all zed. 

(b) u is upper semicontinuous in Q. 

(c) Whenever D(a ; r) c= Q, then 

u(a) < I u(a + re ie ) dd. 

(d) None of the integrals in (c) is — oo. 


335 
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Note that the integrals in (c) always exist and are not + oo, since (a) and ( b ) 
imply that u is bounded above on every compact K c= Q. {Proof. If K n is the set 
of all z e K at which u(z) > n , then K zd => K 2 •• • , so either K n = 0 for some 
".orfl*.* 0, in which case u(z) = oo for some z e K.] Hence (d) says that the 
integrands in (c) belong to I}(T). 

Every real harmonic function is obviously subharmonic. 


17.2 Theorem If u is subharmonic in fi, and if cp is a monotonically increasing 
convex function on R 1 , then cp ° u is subharmonic. 


[To have (p ° u defined at all points of Q, we put cp( — oo) = lim cp(x ) as 
x— * — oo.] 

Proof First, cp o u is upper semicontinuous, since cp is increasing and contin- 
uous. Next, if D(a ; r) c= Q, we have 

cp(u(a -h re ld )) d6. 

The first of these inequalities holds since cp is increasing and u is sub- 
harmonic; the second follows from the convexity of cp , by Theorem 3.3. //// 


</>(«(< *)) 




w(a + re id ) d6 ) < 


17.3 Theorem IfQ is a region, f e H{Q), and f is not identically 0, then log |/| 
is subharmonic in Q, and so are log + | / 1 and I / \ p (0 < p < oo). 

Proof It is understood that log | f(z) \ = — oo if / (z) = 0. Then log | / 1 is 
upper semicontinuous in Q, and Theorem 15.19 implies that log |/| is sub- 
harmonic. 

The other assertions follow if we apply Theorem 17.2 to log |/| in place 
of u, with 


cp{t) = max (0, t) and cp(t) == e pt . //// 

17.4 Theorem Suppose u is a continuous subharmonic function in Q, K is a 
compact subset of Q, h is a continuous real function on K which is harmonic in 
the interior V of K, and u(z) < h(z) at all boundary points of K. Then 
u(z) < h(z) for all z e K. 

This theorem accounts for the term “subharmonic.” Continuity of u is not 
necessary here, but we shall not need the general case and leave it as an exercise. 

Proof Put u 1 = u — h, and assume, to get a contradiction, that iq(z) > 0 for 
some zeV. Since tq is continuous on K, tq attains its maximum m on K; 
and since tq < 0 on the boundary of K, the set E = {z e K: w x (z) = m} is a 
nonempty compact subset of V. Let z 0 be a boundary point of E. Then for 
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some r > 0 we have D(z 0 ; r) c= V, but some subarc of the boundary of 
D(z 0 ; r) lies in the complement of E. Hence 


“l(zo) = m > 


Tn r,“‘ (Z ° 


+ ref 9 ) dd 9 


and this means that is not subharmonic in V. But if u is subharmonic, so 
is u — h, by the mean value property of harmonic functions, and we have our 
contradiction. //// 


17.5 Theorem Suppose u is a continuous subharmonic function in 17, and 

m(r) = f f u(re ie ) d8 (0 < r < 1). (1) 

271 J-n 

If r i < r 2 , then m(r Y ) < m(r 2 ). 

Proof Let h be the continuous function on 5(0; r 2 ) which coincides with u 
on the boundary of 5(Q; r 2 ) and which is harmonic in D(0; r 2 ). By Theorem 
17.4, u < h in D(0; r 2 ). Hence 


r 


h(r 2 e ie ) dO = 


1111 


The Spaces H p and N 

17.6 Notation As in Secs 11.15 and 11.19, we define f r on T by 

fr(e ie ) =f(re i$ ) (0 < r < 1) (1) 

if / is any continuous function with domain U 9 and we let a denote Lebesgue 
measure on T, so normalized that a(T) = 1. Accordingly, Zf-norms will refer to 
Zf(cr). In particular, 

ll/rllp = | [l /, l p d^' P (0 < p < 00), (2) 

11/rlL = sup \f(rei 9 )\, (3) 

0 

and we also introduce 

II fr II 0 = exp j^log + \f r \ da. (4) 

17.7 Definition If/e H(U) and 0 < p < oo, we put 

ll/ll P = sup{||/ r || p :0<r<l}. 


( 1 ) 
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If 0 < p < oo, H p is defined to be the class of all f e H(U) for which 
\\f\\ p < co. (Note that this coincides with our previously introduced termi- 
nology in the case p = co.) 

The class N consists of all / e H(U) for which ||/|| 0 < °°. 

It is clear that H 00 c H p c H s c N if 0 < s < p < oo. 


17.8 Remarks (a) When p < oo, Theorems 17.3 and 17.5 show that \\f r \\ p is a 
nondecreasing function of r, for every / e H(U); when p — co, the same 
follows from the maximum modulus theorem. Hence 

ll/llp = lim ||/ r ||p. (1) 

r- 1 

(b) For 1 < p < oo, \\f\\ p satisfies the triangle inequality, so that H p is a 
normed linear space. To see this, note that the Minkowski inequality gives 

ll(/+ 9) ,11, = II fr + 9r lip < WfrWp + W\ P (2) 

if 0 < r < 1. As r— » 1, we obtain 

\\f+g\\ P £\\f\\ P +\\g\\ P - (3) 

(c) Actually, H p is a Banach space , if 1 < p < oo : To prove completeness, 

suppose {/„} is a Cauchy sequence in H p , |z| < r < R < 1, and apply the 
Cauchy formula to integrating around the circle of radius R , center 0. 

This leads to the inequalities 

(* - r)\m -f m (z)\ < II (f n fm) R II i < ll(/n -fm) R \\ P < Wfn ~ fm\\ P 

from which we conclude that { /„} converges uniformly on compact subsets of 
U to a function / e H(U). Given e > 0, there is an m such that II /„ — f m \\ p < € 
for all n > m, and then, for every r < 1, 

II(/-/JJp= lim ||(/ n -/JJ p < €. (4) 


This gives ||/-/J p -> 0 as oo. 

(d) For p < 1, H p is still a vector space, but the triangle inequality is no 
longer satisfied by ||/|| p . 

We saw in Theorem 15.23 that the zeros of any f e N satisfy the Blaschke 
condition £(1 — |aj) < oo. Hence the same is true in every H p . It is interesting 
that the zeros of any / e H p can be divided out without increasing the norm: 

17.9 Theorem Suppose f e N,f # 0, and B is the Blaschke product formed with 
the zeros off. Put g — f/B. Then g e N and \\g\\ 0 = ||/|| 0 . 

Moreover , iff e H p , then g e H p and \\g\\ p = ||/|| p (0 < p < oo). 



H p -spaces 339 


Proof Note first that 


\g{z)\>\m\ (zeU). (1) 

In fact, strict inequality holds for every z e U, unless / has no zeros in U , in 
which case B — 1 and g — f 

If s and t are nonnegative real numbers, the inequality 

log + (st) < log + s - h log + t (2) 

holds since the left side is 0 if st < 1 and is log s + log t if st > 1. Since 
\g\ = \f\/\B\, (2) gives 


log + I&I < log + I/I + log (3) 

By Theorem 15.24, (3) implies that ||gr|| 0 < ||/|| 0 , and since (1) holds, we 
actually have Mo = ll/llo- 

Now suppose / g H p for some p > 0. Let B n be the finite Blaschke 
product formed with the first n zeros of / (we arrange these zeros in some 
sequence, taking multiplicities into account). Put g n =f/B n . For each n, 
\B„(re ie )\~* 1 uniformly, as r-> 1. Hence \\g H \\ p = \\f\\ p . As n-> co, |gf„| 
increases to | g | , so that 

\\g r \\ P = Mm ll(0 B ) r |lp (0 < r < 1), (4) 

n~* oo 

by the monotone convergence theorem. The right side of (4) is at most \\f\\ p , 
for all r < 1. If we let r— > 1, we obtain ||gr|| p < \\f\\ p . Equality follows now 

from (1), as before. //// 

17.10 Theorem Suppose 0 < p < co, f e H p , / # 0, and B is the Blaschke 
product formed with the zeros of f Then there is a zero-free function h e H 2 
such that 

f=B- h 2lp . (1) 

In particular, every f e H 1 is a product 

f=gh (2) 


in which both factors are in H 2 . 

Proof By Theorem 17.9, f/B e H p ; in fact, \\f/B\\ p — \\f\\ p . Since f/B has no 
zero in U and U is simply connected, there exists cp e H(U) so that 
exp (<p) = f/B (Theorem 13.11). Put h = exp ip(p/2). Then h e H(U) and 
| h | 2 = | f/B \ p , hence h e H 2 , and (1) holds. 

In fact, |l*lll = ll/li;. 

To obtain (2), write (1) in the form / = ( Bh ) • h. 


Illl 
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We can now easily prove some of the most important properties of the H p - 
spaces. 

17.11 Theorem If 0 < p < oo and f e H p , then 

(a) the nontangential maximal functions N a fare in B(T\for all a < 1 ; 

(b) the nontangential limits f*(e ld ) exist a.e. on T, and f* e L P (T); 

(c) lim r _ 1 ||/* -f r || p = 0, and 

M) \\f*\\ p = \\f\\ p . 


Iff e H 1 then f is the Cauchy integral as well as the Poisson integral of f*. 


Proof We begin by proving (a) and ( b ) for the case p > 1. Since holomorphic 
functions are harmonic, Theorem 11.30 (b) shows that every / e H p is then the 
Poisson integral of a function (call it /*) in L?(T). Hence N a f e IF(T ), by 
Theorem 11.25(h), and f*(e w ) is the nontangential limit of / at almost every 
T, by Theorem 11.23. 

If 0 < p < 1 and / e H p , use the factorization 


/= Bh 2lp 


( 1 ) 


given by Theorem 17.10, where B is a Blaschke product, h e H 2 , and h has 
no zero in U. Since | / 1 < | h \ 2/p in U, it follows that 

(N a f) p < (N a h) 2 , (2) 

so that NJe L P (T ), because N a h e L 2 (T). 

Similarly, the existence of B* and h* a.e. on T implies that the non- 
tangential limits of / (call them /*) exist a.e. Obviously, |/* | < N a f wherever 
/* exists. Hence /* e I?{T). 

This proves {a) and (h), for 0 < p < oo. 

Since f r — ►/* a.e. and \f r \ < N a f the dominated convergence theorem 
gives (c). 

If p > 1, (d) follows from (c), by the triangle inequality. If p < 1, use Exer- 
cise 24, Chap. 3, to deduce {d) from (c). 

Finally, life H \ r < 1, and f r (z) = f(rz\ then f r e H(D( 0, 1/r)), and there- 
fore f r can be represented in U by the Cauchy formula 


fXz) ~2n J_„l -e-“z dt 

and by the Poisson formula 

m = ^ j" P{z, e“)f r (e lt ) dt. 


( 3 ) 


( 4 ) 



H p - spaces 341 


For each z e U 9 1 1 — e u z\ and P(z, e lt ) are bounded functions on T. The 
case p = 1 of (c) leads therefore from (3) and (4) to 


and 


/(z) 


m 


= j_ r* /vo 

2n J_„ 1 -e~ l, z 

-s[>^ 


dt 


0/V) dt. 


(5) 

( 6 ) 

llll 


The space H 2 has a particularly simple characterization in terms of power 
series coefficients: 


17.12 Theorem Suppose f e H(U) and 


/(z) = X a„ z". 
0 


TTien f e H 2 if and only if Y,o \ a n\ 2 < °°- 


Proof By ParsevaFs theorem, applied to f r with r < 1, 

X l«J 2 = lim ZlajV” = Hm f \f r \ 2 da = 

0 r->l 0 r-+ 1 Jr 


llll 


The Theorem of F. and M. Riesz 

17.13 Theorem If p is a complex Borel measure on the unit circle T and 


for n = -1, -2, -3, 
Lebesgue measure. 


I 


e dp(t) = 0 


( 1 ) 


then p is absolutely continuous with respect to 


Proof Put / = P[dp], Then / satisfies 

MrhzM (0 < r < 1). (2) 

(See Sec. 1 1.17.) Since, setting z = re w , 

P{z, e“) = £ r M e ine e- im , 


(3) 
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as in Sec. 11.5, the assumption (1), which amounts to saying that the Fourier 
coefficients fi(n) are 0 for all n < 0, leads to the power series 

/(z) = z £(”) z " (z e U). (4) 

0 

By (4) and (2), fe H 1 . Hence /= P[/*], by Theorem 17.11, where 
/* e //(T). The uniqueness of the Poisson integral representation (Theorem 
11.30) shows now that dp = /* //// 

The remarkable feature of this theorem is that it derives the absolute contin- 
uity of a measure from an apparently unrelated condition, namely, the vanishing 
of one-half of its Fourier coefficients. In recent years the theorem has been 
extended to various other situations. 


Factorization Theorems 

We already know from Theorem 17.9 that every f e H p (except / = 0) can be 
factored into a Blaschke product and a function g e H p which has no zeros in U. 
There is also a factorization of g which is of a more subtle nature. It concerns, 
roughly speaking, the rapidity with which g tends to 0 along certain radii. 


17.14 Definition An inner function is a function M e H 00 for which | M* | = 1 
a.e. on T. (As usual, M* denotes the radial limits of M) 

If cp is a positive measurable function on T such that log (p e I}(T), 
and if 


Q(z) = c exp 



e — z 


log (p(e lt ) dt 


( 1 ) 


for z e U, then Q is called an outer function. Here c is a constant, | c \ = 1. 

Theorem 15.24 shows that every Blaschke product is an inner function, 
but there are others. They can be described as follows. 


17.15 Theorem Suppose c is a constant , \c \ = 1, B is a Blaschke product , p is a 
finite positive Borel measure on T which is singular with respect to Lebesgue 
measure , and 


M(z) = cB{z) exp | - j* (z e U). (1) 

Then M is an inner function , and every inner function is of this form. 

Proof If (1) holds and g = M/B, then log | g \ is the Poisson integral of — dp, 
hence log | g \ < 0, so that g e H 00 , and the same is true of M. Also Dp = 0 
a.e., since p is singular (Theorem 7.13), and therefore the radial limits of 
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log | g | are 0 a.e. (Theorem 11.22). Since |£*| = 1 a.e., we see that M is an 
inner function. 

Conversely, let B be the Blaschke product formed with the zeros of a 
given inner function M and put g = M/B. Then log | g | is harmonic in U. 
Theorems 15.24 and 17.9 show that | g \ < 1 in U and that | g* | = 1 a.e. on T. 
Thus log| 0 |<O. We conclude from Theorem i 1.30 that log|#| is the 
Poisson integral of —d\i, for some positive measure /z on T. Since 
log | g* | = 0 a.e. on T, we have Dp = 0 a.e. on T, so p is singular. Finally, 
log | g | is the real part of 


r n e lt + z 

h(z) = - dfi(t), 

J-n e Z 

and this implies that g = c exp ( h ) for some constant c with | c \ = 1. Thus M 
is of the form ( 1 ). 

This completes the proof. //// 


The simplest example of an inner function which is not a Blaschke product is 
the following: Take c = 1 and B — 1, and let \i be the unit mass at t = 0. Then 

M(z) = exp 

which tends to 0 very rapidly along the radius which ends at z = 1. 

17.16 Theorem Suppose Q is the outer function related to cp as in Definition 
17.14. Then 

(a) log | Q | is the Poisson integral of log cp. 

(b) lim r _ x | | = cp(e lB ) a.e. on T. 

(c) Q e H p if and only if cp e LF(T). In this case , || Q || p = \\cp\\ p . 


Proof (a) is clear by inspection and (a) implies that the radial limits of 
log | Q | are equal to log cp a.e. on T, which proves ( b ). If Q e H p , Fatou’s 
lemma implies that || 6 % < ||<2H P , so \\cp\\ p < \\Q\\ p , by (b). Conversely, if 
cp e If(T\ then 


| Q{re ie ) \ p = exp j* P r (d - t) log cp^e") 

~h[ Pr(e ~ t)<pp{ei,) dt ’ 


by the inequality between the geometric and arithmetic means (Theorem 3.3), 
and if we integrate the last inequality with respect to 6 we find that ||Q|| P < 
\\cp\\ p if p < co. The case p = oo is trivial. //// 
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17.17 Theorem Suppose 0 < p < oo, / e H p , and f is not identically 0. Then 
log I/* I e the outer function 

f 1 f n e lt + z ) 

Q /(z) = exp j— I - log | /*(<?“) | dt\ (1) 

is in H p , an J t/iere is an inner function M f such that 

f=M f Q f . (2) 

Furthermore , 

log 1 /( 0 ) I ^ I* log I /VO I dt. (3) 

Equality holds in (3) if and only if M f is constant. 

The functions M f and Q f are called the inner and outer factors of f respec- 
tively; Q f depends only on the boundary values of | / 1 . 

Proof We assume first that f e H 1 . If B is the Blaschke product formed with 
the zeros of / and if g = f/B , Theorem 17.9 shows that g e H 1 ; and since 
I g* I = I /* I a.e. on T, it suffices to prove the theorem with g in place of f 
So let us assume that / has no zero in U and that /( 0) = 1. Then log |/| 
is harmonic in U , log |/(0)| = 0, and since log = log + — log - , the mean 
value property of harmonic functions implies that 

£ J* log" \f(re ie )\ M = 1°8 + I fire 1 *) dO < \\f\\ 0 < U/IU (4) 

for 0 < r < 1. It now follows from Fatou’s lemma that both log + | /* | and 
I /* I are in l}(T\ hence so is log | / * | . 

This shows that the definition (1) makes sense. By Theorem 17.16, Q f e 
H l . Also, | Q* | = | /* | 7 ^ 0 a.e., since log | /* | e l}(T). If we can prove that 

\f(z)\<\Qf(z)\ (zeU), (5) 

then f/Q f will be an inner function, and we obtain the factorization (2). 

Since log \Q f \ is the Poisson integral of log | /* | , (5) is equivalent to the 
inequality 


log|/|<P[log|/*|], (6) 

which we shall now prove. Our notation is as in Chap. 11: P\K] is the 
Poisson integral of the function h e l}(T). 

For | z | < 1 and 0 < R < 1, put f R (z) — f(Rz). Fix z e U. Then 


log | f R (z) I = P[log + I f R I ](z) - P[log- I f R I ](z). 


(7) 



//^-spaces 345 


Since |log + u — log + v\ < \u — v\ for all real numbers u and v , and since 
|| f R — /* Hi— >0 as P— ► 1 (Theorem 17.11), the the first Poisson integral in (7) 
converges to P[log + | /* | ], as R — ► 1. Hence Fatou’s lemma gives 

PUog- |/*|] < lim inf P[log- | f R | ] = P[log + | f* | ] - log | / 1 , (8) 

JR-1 

which is the same as (6). 

We have now established the factorization (2). If we put z — 0 in (5) we 
obtain (3); equality holds in (3) if and only if | /(0) | = | Q f ( 0) | , i.e., if and only 
if \M f (0)\ = 1; and since || Afy|| <*> = 1, this happens only when M f is a con- 
stant. 

This completes the proof for the case p = 1. 

If 1 < p < oo, then H p a H 1 , hence all that remains to be proved is that 
Q f e H p . But if / e H p , then |/* | e L P (T), by Fatou’s lemma; hence e H p , 
by Theorem 17.16(c). 

Theorem 17.10 reduces the case p < 1 to the case p = 2. //// 

The fact that log | /* | e l}(T) has a consequence which we have already used 
in the proof but which is important enough to be stated separately: 

17.18 Theorem I/O < p < oo,/e H p , and f is not identically 0, then at almost 
all points of T we have f*(e 10 ) # 0. 

Proof If f* = 0 then log | /* | = — oo, and if this happens on a set of posi- 
tive measure, then 



log|/V0| — oo. 


//// 


Observe that Theorem 17.18 imposes a quantitative restriction on the loca- 
tion of the zeros of the radial limits of an / e H p . Inside U the zeros are also 
quantitatively restricted, by the Blaschke condition. 

As usual, we can rephrase the above result about zeros as a uniqueness theo- 
rem: 

Iff e H p , g e H p , and f*(e i0 ) = g*(e ie ) on some subset of T whose Lebesgue 
measure is positive , then f(z) = g(z) for all z e U. 

17.19 Let us take a quick look at the class N, with the purpose of determining 
how much of Theorems 17.17 and 17.18 is true here. If / e N and / # 0, we can 
divide by a Blaschke product and get a quotient g which has no zero in U and 
which is in N (Theorem 17.9). Then log | g | is harmonic, and since 

I log \g \ I = 2 log + \g\ -log \g\ (1) 
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and 


^ I* log I g(re w ) | dO = log | 0 ( 0 ) | , ( 2 ) 

we see that log | g | satisfies the hypotheses of Theorem 11.30 and is therefore the 
Poisson integral of a real measure \i . Thus 

/(z) = cB(z) exp (3) 

where c is a constant, | c | = 1, and B is a Blaschke product. 

Observe how the assumption that the integrals of \og + \g\ are bounded 
(which is a quantitative formulation of the statement that \g \ does not get too 
close to oo) implies the boundedness of the integrals of log" | g | (which says that 
| g | does not get too close to 0 at too many places). 

If \i is a negative measure, the exponential factor in (3) is in J/ 00 . Apply the 
Jordan decomposition to p. This shows: 

To every f e N there correspond two functions and b 2 e H 00 such that b 2 has 

no zero in U and f = bjb 2 . 

Since b\ ^ 0 a.e., it follows that / has finite radial limits a.e. Also,/* ^ Oa.e. 

Is log |/* | e l}(T)l Yes, and the proof is identical to the one given in 
Theorem 17.17. 

However, the inequality (3) of Theorem 17.17 need no longer hold. For 
example, if 

/(z) = exp (4) 

then || /Ho = e, \f* \ = 1 a.e., and 

log | /( 0 ) I = 1 > 0 = fn f l° g ' r(e ‘ ,] 1 dt ' (5) 


The Shift Operator 

17.20 Invariant Subspaces Consider a bounded linear operator S on a Banach 
space X ; that is to say, S is a bounded linear transformation of X into X. If a 
closed subspace Y of X has the property that S(Y) c= Y, we call Y an S-invariant 
subspace . Thus the S-invariant subspaces of X are exactly those which are 
mapped into themselves by S. 

The knowledge of the invariant subspaces of an operator S helps us to visual- 
ize its action. (This is a very general — and hence rather vague — principle: In 
studying any transformation of any kind, it helps to know what the transform- 
ation leaves fixed.) For instance, if S is a linear operator on an n-dimensional 
vector space X and if S has n linearly independent characteristic vectors x u ..., 
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x„, the one-dimensional spaces spanned by any of these x f are 5-invariant, and 
we obtain a very simple description of 5 if we take {x 1? . . . , x„} as a basis of X. 

We shall describe the invariant subspaces of the so-called “ shift operator ” 5 
on / 2 . Here / 2 is the space of all complex sequences 



•} 

(1) 

for which 



V 

<N 

' rT 

s 

8 [^4? 

II 

00, 

(2) 

and S takes the element x e t 2 given by (1) to 



Sx = {0, £ 0 , 


(3) 


It is clear that 5 is a bounded linear operator on / 2 and that ||5|| = 1. 

A few 5-invariant subspaces are immediately apparent : If Y k is the set of all 
x e if 2 whose first k coordinates are 0, then Y k is 5-invariant. 

To find others we make use of a Hilbert space isomorphism between / 2 and 
H 2 which converts the shift operator 5 to a multiplication operator on H 2 . The 
point is that this multiplication operator is easier to analyze (because of the 
richer structure of H 2 as a space of holomorphic functions) than is the case in the 
original setting of the sequence space / 2 . 

We associate with each x e / 2 , given by (1), the function 

/(z)=££„z" (z e U). (4) 

n = 0 

By Theorem 17.12, this defines a linear one-to-one mapping of / 2 onto H 2 . If 

y = {fin}, 9{z) = Xl.z" (5) 

n = 0 

and if the inner product in H 2 is defined by 

(/, 9) = Y n J* / VVV 5 ) de, (6) 

the Parsevai theorem shows that (/, g) = (x, y). Thus we have a Hilbert space 
isomorphism of / 2 onto H 2 , and the shift operator 5 has turned into a multiplica- 
tion operator (which we still denote by 5) on H 2 : 

(5/)(z) = zf(z) (f e H 2 , z e U). (7) 

The previously mentioned invariant subspaces Y k are now seen to consist of 
all f e H 2 which have a zero of order at least k at the origin. This gives a clue: 
For any finite set {a 1? . .., <x k } c= U, the space Y of all f e H 2 such that /(a x ) = 

• • • = /(a k ) = 0 is 5-invariant. If B is the finite Blaschke product with zeros at a 1? 

. . . , a k , then / e Y if and only if f/B e H 2 . Thus Y = BH 2 . 
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This suggests that infinite Blaschke products may also give rise to 5-invariant 
subspaces and, more generally, that Blaschke products might be replaced by arbi- 
trary inner functions (p. It is not hard to see that each (pH 2 is a closed 5-invariant 
subspace of H 2 , but that every closed 5-invariant subspace of H 2 is of this form is 
a deeper result. 


17.21 Beurling’s Theorem 

(a) For each inner function q> the space 

<pH 2 = {<pf:fe H 2 } (1) 

is a closed S-invariant subspace of H 2 . 

(b) If (p Y and cp 2 are inner functions and if (p^H 2 = cp 2 H 2 , then (pj(p 2 is constant. 

(c) Every closed S-invariant subspace Y of H 2 , other than {0}, contains an inner 
function cp such that Y = (pH 2 . 

Proof H 2 is a Hilbert space, relative to the norm 

(2) 

If (p is an inner function, then | (p* | = 1 a.e. The mapping/—* (pf is there- 
fore an isometry of H 2 into H 2 ; being an isometry, its range (pH 2 is a closed 
subspace of H 2 . IProof : If (pf n ^g in H 2 , then .{<?/„} is a Cauchy sequence, 
hence so is {/„}, hence f n — >f e H 2 , so g = (pf e (pH 2 .'] The 5-invariance of 
(pH 2 is also trivial, since z • (pf = q> • zf. Hence (a) holds. 

If (p t H 2 = (p 2 H 2 , then (p 1 = (p 2 f for some / e H 2 , hence (pj(p 2 £ H 2 . 
Similarly, (p 2 /(p l e H 2 . Put q> = (pj(p 2 and h = q> 4- (1 /(p). Then h e H 2 , and 
since | (p* \ = 1 a.e. on T, h* is real a.e. on T. Since h is the Poisson integral of 
h*, it follows that h is real in U, hence h is constant. Then (p must be con- 
stant, and (b) is proved. 

The proof of (c) will use a method originated by Helson and Lowdensla- 
ger. Suppose Y is a closed 5-invariant subspace of H 2 which does not consist 
of 0 alone. Then there is a smallest integer k such that Y contains a function / 
of the form 


/(z)=lv, c k = 1. (3) 

n — k 

Then f$zY, where we write zY for the set of all g of the form 
g(z) = z/(z),/e Y. It follows that zY is a proper closed subspace of Y [closed 
by the argument used in the proof of (a)], so Y contains a nonzero vector 
which is orthogonal to zY (Theorem 4.11). 
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So there exists a cp e Y such that ||<p|| 2 = 1 and cp J_ zY. Then cp J_ z n cp 9 
for n = 1, 2, 3, .... By the definition of the inner product in H 2 [see 17.20(6)] 
this means that 


2 n 



d0 = 0 


(«= 1, 2, 3,...). 


(4) 


These equations are preserved if we replace the left sides by their complex 
conjugates, i.e., if we replace n by —n. Thus all Fourier coefficients of the 
function | q>* \ 2 e I}(T) are 0, except the one corresponding to n = 0, which is 
1. Since L 1 -functions are determined by their Fourier coefficients (Theorem 
5.15), it follows that \q>*\ = 1 a.e. on T. But (p e H 2 , so cp is the Poisson 
integral of cp* 9 and hence | cp \ < 1. We conclude that cp is an inner function. 

Since cp e Y and Y is 5-invariant, we have cpz n e Y for all n > 0, hence 
cpP e Y for every polynomial P. The polynomials are dense in H 2 (the partial 
sums of the power series of any / e H 2 converge to / in the H 2 - norm, by 
Parseval’s theorem), and since Y is closed and | cp \ < 1, it follows that 
cpH 2 a Y. We have to prove that this inclusion is not proper. Since cpH 2 is 
closed, it is enough to show that the assumptions h e Y and h _L cpH 2 imply 
h = 0. 

If h 1 cpH 2 9 then h 1 cpz n for n = 0, 1, 2, ... , or 


_ 1 _ 

2n 


f 


h*(e i0 )cp*(e i9 )e~ 


inO 


d0 = 0 


(« = 0 , 1 , 2 ,...). 


(5) 


If h e Y, then z n h e zY if n = 1, 2, 3, . . . , and our choice of cp shows that 
z n h _L cp 9 or 


2n 


r 


h*(e l0 )cp*(e l0 )e ln0 d6 = 0 


(n = -1, -2, -3,...). 


( 6 ) 


Thus all Fourier coefficients of h*cp* are 0, hence h*cp* = 0 a.e. on T ; and 
since | cp* \ = 1 a.e., we have h* = 0 a.e. Therefore h = 0, and the proof is 
complete. //// 


17.22 Remark If we combine Theorems 17.15 and 17.21, we see that the 
S-invariant subspaces of H 2 are characterized by the following data: a 
sequence of complex numbers {a„} (possibly finite, or even empty) such that 
l a J < 1 and Z(l-|aJ) < oo, and a positive Borel measure ix on T, singular 
with respect to Lebesgue measure (so D/x = 0 a.e.). It is easy (we leave this as 
an exercise) to find conditions, in terms of {a„} and ju, which ensure that one 
5-invariant subspace of H 2 contains another. The partially ordered set of all 
5-invariant subspaces is thus seen to have an extremely complicated struc- 
ture, much more complicated than one might have expected from the simple 
definition of the shift operator on / 2 . 

We conclude the section with an easy consequence of Theorem 17.21 
which depends on the factorization described in Theorem 17.17. 
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17.23 Theorem Suppose M f is the inner factor of a function f e H 2 , and Y is 
the smallest closed S-invariant subspace of H 2 which contains f Then 

Y = M f H 2 . (1) 

In particular , Y = H 2 if and only if f is an outer function. 

Proof Let / = M f Q f be the factorization of / into its inner and outer 
factors. It is clear that / e M f H 2 \ and since M f H 2 is closed and 5-invariant, 
we have Y c= M f H 2 . 

On the other hand, Theorem 17.21 shows that there is an inner function 
(p such that Y = (pH 2 . Since / e 7, there exists an h = M h Q h e H 2 such that 

M f Q f = cpM h Q h . (2) 

Since inner functions have absolute value 1 a.e. on T, (2) implies that Q f = 
Q h , hence M f = (pM h e 7, and therefore Y must contain the smallest 5- 
in variant closed subspace which contains M f . Thus M f H 2 a Y , and the 
proof is complete. //// 


It may be of interest to summarize these results in terms of two questions to 
which they furnish answers. 

If f e H 2 , which functions g e H 2 can be approximated in the H 2 - norm by 
functions of the form fP, where P runs through the polynomials? Answer: Pre- 
cisely those g for which g/M f e H 2 . 

For which / e H 2 is it true that the set {fP} is dense in H 2 1 Answer: Precisely 
for those / for which 


log 1/(0) | 


if: 


log I /VO | dt. 


Conjugate Functions 

17.24 Formulation of the Problem Every real harmonic function u in the unit disc 
U is the real part of one and only one f e H(U) such that/(0) = w(0). If f =u + iv , 
the last requirement can also be stated in the form t^O) = 0. The function v is 
called the harmonic conjugate of w, or the conjugate function of u. 

Suppose now that u satisfies 


sup IM, < CO (1) 

r< 1 

for some p. Does it follow that (1) holds then with v in place of w? 

Equivalently, does it follow that f e H P 1 

The answer (given by M. Riesz) is affirmative if 1 < p < oo. (For p = 1 and 
p = oo it is negative; see Exercise 24.) The precise statement is given by Theorem 
17.26. 
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Let us recall that every harmonic u that satisfies (1) is the Poisson integral of 
a function w* e B(T) (Theorem 11.30) if 1 < p < oo. Theorem 11.11 suggests 
therefore another restatement of the problem : 

if i < p < oo, and if we associate to each h e B(T) the holomorphic function 

(#X*) = L J n h(e“) dt (z e 17), (2) 

do all of these functions ij/h lie in H pC t 

Exercise 25 deals with some other aspects of this problem. 

17.25 Lemma // 1 < p < 2, d = 7i/(l + p), a = (cos <5) _1 , and P = a p (l + a), 
then 

1 < /?(cos </>) p - a cos pep ^ ^ < (p < 0 (1) 

Proof If S < \ cp \ < nf 2, then the right side of (1) is not less than 
— a cos pep > —cl cos pd = a cos <5=1, 
and it exceeds /?(cos <5) p — a = 1 if | cp \ < 3. //// 

17.26 Theorem // 1 < p < oo, t/ien tfeere is a constant A p < oo such that the 
inequality 

m\\ P <A P m P a) 

holds for every h e IF{T). 

More explicitly, the conclusion is that ij/h (defined in Sec. 17.24) is in H p , 
and that 

[\m)r\ p da<A p p [\h\ p do (0 < r < 1) (2) 

Jr Jr 

where do — dQ/2n is the normalized Lebesgue measure on T. 

Note that h is not required to be a real function in this theorem, which 
asserts that i//: L p — ► H p is a bounded linear operator. 

Proof Assume first that 1 < p < 2, that h e L P (T), h > 0, h # 0, and let u be 
the real part of / = ij/h. Formula 11.5(2) shows that u = P[h\ hence u > 0 in 
U. Since U is simply connected and /has no zero in 17, there is a g e H{U) 
such that g = / p , #(0) > 0. Also, u = | / 1 cos <p, where cp is a real function 
with domain U that satisfies | <p | < tt/2. 

If a = a p and P — P p are chosen as in Lemma 17.25, it follows that 

f |/ r P < 0 f (« r ) p da - a f |/ r | p cos (p(p r ) da (3) 

Jr Jr Jr 


for 0 < r < 1. 
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Note that | / \ p cos p<p = Re g. The mean value property of harmonic 
functions shows therefore that the last integral in (3) is equal to Re p(0) > 0. 
Hence 


jW</<7<0 £ 


h p do (0 < r < 1) 


(4) 


because u = P[/i] implies ||u r || p < ||Ji|| p . Thus 

mw P <p iip \\h\\ p ( 5 ) 

if h e B{T\ h>0. 

If h is an arbitrary (complex) function in L P (T), the preceding result 
applies to the positive and negative parts of the real and imaginary parts of h. 
This proves (2), for 1 < p < 2, with A p = 4/? 1/p . 

To complete the proof, consider the case 2 < p < oo. Let w e Z?(T), 
where q is the exponent conjugate to p. Put w(e w ) = w(e~ i& ). A simple compu- 
tation, using Fubini’s theorem, shows for any h e L?(T) that 

j ( il/h) r w dc 7=| (i //w) r h da (0 < r < 1). (6) 

Jr Jr 


Since q < 2, (2) holds with w and q in place of h and p, so that (6) leads to 


I 


(ij/h) r w da 




(7) 


Now let w range over the unit ball of I3(T) and take the supremum on the 
left side of (7). The result is 


'j i Ip 

J(\l/h)r\ p do> <A , 


\h\ p do 


i Ip 


(0 < r < 1). 


( 8 ) 


Hence (2) holds again, with A p < A q . 


Ill V 


(If we take the smallest admissible values for A p and A q , the last calculation 
can be reversed, and shows that A p = A q .) 


Exercises 

1 Prove Theorems 17.4 and 17.5 for upper semicontinuous subharmonic functions. 

2 Assume / e H(Q) and prove that log (1 + | / 1 ) is subharmonic in Cl 

3 Suppose 0 < p < oo and / e H(U). Prove that / e H p if and only if there is a harmonic function u in 
U such that |/(z)| p < u(z) for all z e U. Prove that if there is one such harmonic majorant u of |/| p , 
then there is at least one, say u f . (Explicitly, |/| p < u f and u f is harmonic; and if |/| p < u and u is 
harmonic, then u f < u .) Prove that ||/|| p = u/Q) llp . Hint : Consider the harmonic functions in D{ 0; R ), 
R < 1, with boundary values |/| p , and let R—*l. 

4 Prove likewise that / e N if and only if log + | / 1 has a harmonic majorant in U. 
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5 Suppose / e H p , (p e H((/), and <p((/) c t/. Does it follow that /° (p e H pe ! Answer the same ques- 
tion with N in place of H p . 

6 IfO <r < s < oo, show that H s is a proper subclass of H r . 

7 Show that H 00 is a proper subclass of the intersection of all H p with p < oo. 

8 If/e H 1 and/* e L P (T), prove that/e H p . 

9 Suppose / e H((7) and /((/) is not dense in the plane. Prove that / has finite radial limits at almost 
all points of T. 

10 Fix a e U. Prove that the mapping/— ►/(<*) is a bounded linear functional on H 2 . Since H 2 is a 
Hilbert space, this functional can be represented as an inner product with some g e H 2 . Find this g. 

11 Fix a e U. How large can |/'(a)| be if ||/|| 2 < 1? Find the extremal functions. Do the same for 

/ (B H 

12 Suppose p > 1 ,/ e H p , and /* is real a.e. on T. Prove that /is then constant. Show that this result 
is false for every p < 1. 

13 Suppose / e H(U), and suppose there exists an M < oo such that / maps every circle of radius 
r < 1 and center 0 onto a curve y r whose length is at most M. Prove that / has a continuous extension 
to U and that the restriction off to T is absolutely continuous. 

14 Suppose p is a complex Borel measure on T such that 

| e iM dfj{t) = 0 (n= 1,2,3,...). 

Prove that then either p = 0 or the support of p is all of T. 

15 Suppose K is a proper compact subset of the unit circle T. Prove that every continuous function 
on K can be uniformly approximated on K by polynomials. Hint : Use Exercise 14. 

16 Complete the proof of Theorem 17.17 for the case 0 < p < 1. 

17 Let (p be a nonconstant inner function with no zero in U. 

(a) Prove that \/q> $ H p if p > 0. 

(b) Prove that there is at least one e ie e T such that lim r _ 1 ^(rc w ) = 0. 

Hint : log | (p \ is a negative harmonic function. 

18 Suppose q> is a nonconstant inner function, |a| < 1, and a £ (p{U ). Prove that lim r _ >1 <p(re ,e ) = a for 
at least one e ie e T. 

19 Suppose / e H 1 and 1 /f e H l . Prove that /is then an outer function. 

20 Suppose / e H 1 and Re [/(z)] > 0 for all z e U. Prove that /is an outer function. 

21 Prove that / e N if and only iff — g/h , where g and h e H 00 and h has no zero in U. 

22 Prove the following converse of Theorem 1 5.24 : 

If/e H{U) and if 


lim I | log | / (re ie ) \ \ dO = 0, (*) 

r~* 1 J-n 

then / is a Blaschke product. Hint : (*) implies 

lim J* log + | /(re") | dd = 0. 

Since log + |/| >0, it follows from Theorems 17.3 and 17.5 that log + |/| = 0, so |/| < 1. Now 
/ = Bg , g has no zeros, \g \ < 1, and (*) holds with 1/g in place of /. By the first argument, 1 1/g \ < 1. 
Hence \g \ = 1. 

23 Find the conditions mentioned in Sec. 17.22. 
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24 The conformal mapping of U onto a vertical strip shows that M. Riesz’s theorem on conjugate 
functions cannot be extended to p = oo. Deduce that it cannot be extended to p = 1 either. 

25 Suppose 1 < p < oo, and associate with each / e E{T) its Fourier coefficients 

f{n) = / | f(e“)e- im dt (n = 0, ±1, + 2, . . .). 

2jt J-« 

Deduce the following statements from Theorem 17.26: 

(u) To each / e E(T) there corresponds a function g e E{T) such that g(n ) =f(n) for n > 0 but 
g{n) = 0 for all n < 0. In fact, there is a constant C, depending only on p, such that 

\\g\\ p <c\\f\\ p . 

The mapping/— ► g is thus a bounded linear projection of E(T) into E(T). The Fourier series of g is 
obtained from that of /by deleting the terms with n < 0. 

(b) Show that the same is true if we delete the terms with n < k, where k is any given integer. 

(c) Deduce from ( b ) that the partial sums s n of the Fourier series of any / e E{T) form a bounded 
sequence in E(T). Conclude further that we actually have 

lim II/— sj|p = 0. 


(d) If/e E{T) and if 


F(z) = £ / (n)z” , 

n = 0 

then F e H p , and every F e H p is so obtained. Thus the projection mentioned in (a) may be regarded 
as a mapping of E{T) onto H p . 

26 Show that there is a much simpler proof of Theorem 17.26 if p = 2, and find the best value of A 2 . 

27 Suppose /(z) = a n z" in U and ^ I <* n | c °o. Prove that 


J> 


\f'(re i0 )\ dr < oo 


for all 0. 

28 Prove that the following statements are correct if {n k } is a sequence of positive integers which 
tends to oo sufficiently rapidly. If 


00 z"* 

/<*>- I j 

k= l K 


then | f\z ) | > nfj(10k) for all z such that 


1 1 

1 < |z| < 1 . 

n b 2n * 


Hence 


j> 


\f(re ,e )\dr = oo 


lim 

K-»l 


f 


/'(re“) *■ 


for every 0, although 
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exists (and is finite) for almost all 6. Interpret this geometrically, in terms of the lengths of the images 
under / of the radii in U. 

29 Use Theorem 17.11 to obtain the following characterization of the boundary values of H p - 
functions, for 1 < p < oo : 

A function g e IF(T) is/* (a.e.) for some / e H p if and only if 

^ J' A = 0 

for all negative integers n. 



CHAPTER 

EIGHTEEN 


ELEMENTARY THEORY OF 
BANACH ALGEBRAS 


Introduction 

18.1 Definitions A complex algebra is a vector space A over the complex field 
in which an associative and distributive multiplication is defined, i.e., 

x(yz) = (xy)z, (x + y)z = xz 4* yz, x(y + z) = xy + xz (1) 

for x, y, and z e A 9 and which is related to scalar multiplication so that 

a(xy) = x(xy) = (<xx)y (2) 


for x and y e A, a a scalar. 

If there is a norm defined in A which makes A into a normed linear space 
and which satisfies the multiplicative inequality 

\\xy\\ < IMIllyll (x and y e A\ (3) 

then A is a normed complex algebra. If, in addition, A is a complete metric 
space relative to this norm, i.e., if A is a Banach space, then we call A a 
Banach algebra. 

The inequality (3) makes multiplication a continuous operation. This 
means that if x„— ► x and y n —> y 9 then x n y„— ► xy, which follows from (3) and 
the identity 


X„ y„-xy = (x n - x)y„ + x(y n - y). (4) 

Note that we have not required that A be commutative, i.e., that xy = yx 
for all x and y e A 9 and we shall not do so except when explicitly stated. 


356 
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However, we shall assume that A has a unit. This is an element e such 

that 


xe = ex = x (x e A). (5) 

It is easily seen that there is at most one such e (e f = e'e = e) and that 
\\e\\ > 1, by (3). We shall make the additional assumption that 

Ml = 1. (6) 

An element x e A will be called invertible if x has an inverse in A , i.e., if 
there exists an element x~ l e A such that 

x~ l x = xx~ l = e. (7) 

Again, it is easily seen that no x e A has more than one inverse. 

If x and y are invertible in A , so are x _1 and xy, since (xy) _1 == y _1 x _1 . 
The invertible elements therefore form a group with respect to multiplication. 

The spectrum of an element x e A is the set of all complex numbers k 
such that x — ke is not invertible. We shall denote the spectrum of x by <r(x). 

18.2 The theory of Banach algebras contains a great deal of interplay between 
algebraic properties on the one hand and topological ones on the other. We 
already saw an example of this in Theorem 9.21, and shall see others. There 
are also close relations between Banach algebras and holomorphic functions: 
The easiest proof of the fundamental fact that o{x) is never empty depends on 
Liouville’s theorem concerning entire functions, and the spectral radius formula 
follows naturally from theorems about power series. This is one reason for re- 
stricting our attention to complex Banach algebras. The theory of real Banach 
algebras (we omit the definition, which should be obvious) is not so satisfactory. 

The Invertible Elements 

In this section, A will be a complex Banach algebra with unit e , and G will be the 
set of all invertible elements of A. 

18.3 Theorem If x e A and ||x|| < 1, then e + x e G, 

(e + x)- 1 = £(-l)"x", (1) 

n = 0 

and 


||(e + x) 1 - e + x|| < . (2) 

1 - Wl 

Proof The multiplicative inequality 18.1(3) shows that ||x w || < ||x|| w . If 
s N = e — x + x 2 — • • • + (— l) N x N , 


( 3 ) 
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it follows that { s N } is a Cauchy sequence in A , hence the series in (1) con- 
verges (with respect to the norm of A) to an element y e A. Since multiplica- 
tion is continuous and 


(e + x)s N = e + (- l) N x N+1 = s^e + x), 
we see that (e + x)y = e = y(e + x). This gives (1), and (2) follows from 

L(-i)v 

n-2 


< I II*" 

n — 2 


^ I 


* = 


1 - llxll 


(4) 

(5) 

llll 


18.4 Theorem Suppose x g G, ||x i \\ = 1/a, h e A, and \\h\\ = /? < a. T/ien 
x + h g G, and 


||(x + /i)" 1 - x' 1 + 


< 


/? 2 

a 2 (a-«* 


( 1 ) 


Proof ||x 1 h|| < )9/a < 1, hence e + x g G, by Theorem 18.3; and since 
x + h = x(e + x" 1 h), we have x + /igG and 

(x + h)" 1 =(e + x" 1 /i)" 1 x" 1 . (2) 

Thus 

(x + h)~ l — x _1 + x~ l hx~ l = [(e + x~ 1 h)~ 1 — e + x _ 1 /i]x _1 , (3) 

and the inequality (1) follows from Theorem 18.3, with x~ l h in place of x. //// 

Corollary 1 G is an open set , and t/ie mapping x— ► x -1 is a homeomorphism of 
G onto G. 

For if x g G and ||h||-*0, (1) implies that ||(x + h)~ l — x _1 || — ► 0. Thus 
x^ x _1 is continuous; it clearly maps G onto G, and since it is its own inverse, it 
is a homeomorphism. 

Corollary 2 The mapping x— ► x - 1 is differentiable. Its differential at any 
x g G is the linear operator which takes h e A to —x~ l hx~ 1 . 

This can also be read off from (1). Note that the notion of the differential of a 
transformation makes sense in any normed linear space, not just in R k , as in 
Definition 7.22. If A is commutative, the above differential takes h to — x" 2 /i, 
which agrees with the fact that the derivative of the holomorphic function z -1 is 


Corollary 3 For every x e A, cr(x) is compact , and |2| < ||x|| if X e cr(x). 
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For if | X | > 11*11, then e — X~ 1 x e G, by Theorem 18.3, and the same is true 
of * — Xe = — X(e — X~ l x); hence X $ o{x). To prove that <x(x) is closed, observe 
(a) X g a(x) if and only if * — Xe $ G; (b) the complement of G is a closed subset of 
A, by Corollary 1; and (c) the mapping X— > x — Xe is a continuous mapping of 
the complex plane into A. 

18.5 Theorem Let Q> be a bounded linear functional on A, fix x e A, and define 
f(k) = <t[(x - Xe) ~ *] (A $ <x(x)). (1) 

Then f is holomorphic in the complement of (fix), andf(X )— ► 0 as X—* oo. 

Proof Fix X <r(x) and apply Theorem 18.4 with * — Xe in place of * and 
with (X — p)e in place of h. We see that there is a constant C, depending on * 
and X, such that 

||(* - pe)' 1 - (x - Xe)" 1 +(X- p)fx - Xe)~ 2 \\ <C\p- X\ 2 (2) 

for all p which are close enough to X. Thus 

(*-'*)-' (3) 

P A 

as /a — > X, and if we apply O to both sides of (3), the continuity and linearity of 
O show that 


m -m 

p — X 


<!>[(* — Ae)“ 2 ]. 


(4) 


So / is differentiable and hence holomorphic outside o(x). Finally, as 2— ► oo 
we have 


m) = 4>Mx - Xe ) ~ »] = -e) ^<t>(- e ), 

by the continuity of the inversion mapping in G. 

18.6 Theorem For every x e A, o(x) is compact and not empty . 


(5) 

//// 


Proof We already know that o(x) is compact. Fix x e A, and fix X 0 $ <r(x). 
Then (* — X 0 e)~ l ^0, and the Hahn-Banach theorem implies the existence 
of a bounded linear functional <I> on A such that f(X 0 ) ^ 0, where /is defined 
as in Theorem 18.5. If o(x) were empty, Theorem 18.5 would imply that /is 
an entire function which tends to 0 at oo, hence f(X) = 0 for every X, by 
Liouville’s theorem, and this contradicts f(X 0 ) ^ 0. So o(x) is not empty. //// 


18.7 Theorem (Gelfand-Mazur) If A is a complex Banach algebra with unit in 
which each nonzero element is invertible , then A is ( isometrically isomorphic to) 
the complex field. 
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An algebra in which each nonzero element is invertible is called a division 
algebra . Note that the commutativity of A is not part of the hypothesis; it is part 
of the conclusion. 

Proof If x e A and ^ X 2 > at least one of the elements x — X±e and 
x — X 2 e must be invertible, since they cannot both be 0. It now follows from 
Theorem 18.6 that <r(x) consists of exactly one point, say 2(x), for each x e A. 
Since x — X(x)e is not invertible, it must be 0, hence x = X(x)e. The mapping 
x-*2(x) is therefore an isomorphism of A onto the complex field, which is 
also an isometry, since | 2(x)| = ||2(x)e|| = ||x|| for all x e A. //// 

18.8 Definition For any x e A, the spectral radius p(x) of x is the radius of 
the smallest closed disc with center at the origin which contains <r(x) 
(sometimes this is also called the spectral norm of x; see Exercise 14): 

p(x) = sup { \X \ : X e <r(x)}. 


18.9 Theorem (Spectral Radius Formula) For every x e A, 

lim \\x n \\ l/n = p(x). (1) 

n~* oo 

( This existence of the limit is part of the conclusion.) 

Proof Fix x e A, let n be a positive integer, X a complex number, and 
assume X n 4 <r(x w ). We have 

(x" - X n e) = (x - Xe)(x n ~ 1 + Xx n ~ 2 + • • • + X n ~ 1 e). (2) 

Multiply both sides of (2) by (x w — X n e)~ l . This shows that x — Xe is invert- 
ible, hence X $ <r(x). 

So if X e < t ( x ), then X n e <r(x n ) for n = 1, 2, 3, ... . Corollary 3 to Theorem 
18.4 shows that \X n \ < ||x w ||, and therefore \X \ < ||x w || 1/w . This gives 

p(x) < lim inf ||x w || 1/w . (3) 

n~> oo 

Now if \X \ > ||x||, it is easy to verify that 

(Xe — x) £ X~ n ~ 1 x n = e. (4) 

n = o 

The above series is therefore — (x — Xe)~ l . Let O be a bounded linear func- 
tional on A and define / as in Theorem 18.5. By (4), the expansion 

m = - £ 

n = 0 


( 5 ) 
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is valid for all X such that \X\ > ||x||. By Theorem 18.5,/ is holomorphic 
outside <x(x), hence in the set {X: \X\ > p(x)}. It follows that the power series 
(5) converges if \X \ > p(x). In particular, 


sup < 00 ( | A | > p(x)) 

n 

(6) 

for every bounded linear functional O on A. 

It is a consequence of the Hahn-Banach theorem (Sec. 5.21) that the 
norm of any element of A is the same as its norm as a linear functional on 
the dual space of A. Since (6) holds for every O, we can now apply the 
Banach-Steinhaus theorem and conclude that to each X with \X \ > p(x) there 
corresponds a real number C{X) such that 

p-V|| < C(A) (n = 1,2,3,...). 

(7) 

Multiply (7) by | A |" and take nth roots. This gives 


||x"|| 1/ ’’ < | A| [C(A)] 1/B (» = 1,2,3,...) 

(8) 

if | A | > p(x), and hence 


lim sup ||x"|| 1/ * < p(x). 

n~* oo 

(9) 

The theorem follows from (3) and (9). 

llll 


18.10 Remarks 

(a) Whether an element of A is or is not invertible in A is a purely algebraic 
property. Thus the spectrum of x, and likewise the spectral radius p(x), 
are defined in terms of the algebraic structure of A , regardless of any 
metric (or topological) considerations. The limit in the statement of 
Theorem 18.9, on the other hand, depends on metric properties of A. 
This is one of the remarkable features of the theorem: It asserts the 
equality of two quantities which arise in entirely different ways. 

(b) Our algebra may be a subalgebra of a larger Banach algebra B (an 
example follows), and then it may very well happen that some x e A is 
not invertible in A but is invertible in B. The spectrum of x therefore 
depends on the algebra; using the obvious notation, we have o A (x) => 
<r B (x), and the inclusion may be proper. The spectral radius of x, however, 
is unaffected by this, since Theorem 18.9 shows that it can be expressed 
in terms of metric properties of powers of x, and these are independent of 
anything that happens outside A. 

18.11 Example Let C(T) be the algebra of all continuous complex functions 
on the unit circle T (with pointwise addition and multiplication and the 
supremum norm), and let A be the set of all f e C(T) which can be extended 
to a continuous function F on the closure of the unit disc U , such that F is 
holomorphic in JJ. It is easily seen that A is a subalgebra of C(T). If/, e A 
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and {/„} converges uniformly on T, the maximum modulus theorem forces 
the associated sequence { F n } to converge uniformly on the closure of U. This 
shows that A is a closed subalgebra of C(T), and so A is itself a Banach 
algebra. 

Define the function f 0 by f 0 (e w ) = e w . Then F 0 (z) = z. The spectrum of f 0 
as an element of A consists of the closed unit disc; with respect to C(T), the 
spectrum of f 0 consists only of the unit circle. In accordance with Theorem 
18.9, the two spectral radii coincide. 

Ideals and Homomorphisms 

From now on we shall deal only with commutative algebras. 

18.12 Definition A subset / of a commutative complex algebra A is said to be 
an ideal if (a) I is a subspace of A (in the vector space sense) and (b) xy e / 
whenever x e A and y e /. If / # A, I is a proper ideal. Maximal ideals are 
proper ideals which are not contained in any larger proper ideals. Note that 
no proper ideal contains an invertible element. 

If B is another complex algebra, a mapping cp of A into B is called a 
homomorphism if cp is a linear mapping which also preserves multiplication: 
(p(x)cp(y) = cp(xy) for all x and y e A. The kernel (or null space) of cp is the set 
of all x e A such that <p(x) = 0. It is trivial to verify that the kernel of a 
homomorphism is an ideal. For the converse, see Sec. 18.14. 

18.13 Theorem If A is a commutative complex algebra with unit , every proper 
ideal of A is contained in a maximal ideal If in addition , A is a Banach 
algebra , every maximal ideal of A is closed. 

Proof The first part is an almost immediate consequence of the Hausdorff 
maximality principle (and holds in any commutative ring with unit). Let / be 
a proper ideal of A. Partially order the collection & of all proper ideals of A 
which contain / (by set inclusion), and let M be the union of the ideals in 
some maximal linearly ordered subcollection i? of £P. Then M is an ideal 
(being the union of a linearly ordered collection of ideals), / M, and M ^ 
A, since no member of & contains the unit of A. The maximality of i? implies 
that M is a maximal ideal of A. 

If A is a Banach algebra, the closure M of M is also an ideal (we leave 
the details of the proof of this statement to the reader). Since M contains no 
invertible element of A and since the set of all invertible elements is open, we 
have M =/= A, and the maximality of M therefore shows that M = M. //// 

18.14 Quotient Spaces and Quotient Algebras Suppose J is a subspace of a vector 

space A , and associate with each x e A the coset 


(p(x) = x + J = {x + y: y e J}. 


( 1 ) 
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If x t — x 2 e J , then cpixj = <p(x 2 ). If x t — x 2 $ J , (p(x t ) n <p(x 2 ) = 0. The set of 
all cosets of J is denoted by A/J; it is a vector space if we define 

(p{x) + (p(y) = cp(x + _y), X(p(x) = <p(Xx) (2) 

for x and y e A and scalars X. Since J is a vector space, the operations (2) are well 
defined; this means that if <p(x) = <p(x') and cp(y) = (p{y'\ then 

cp(x) + cp{y) = <p(x') + (p(y'), (3) 

Also, cp is clearly a linear mapping of A onto A/J ; the zero element of A/J is 
<p( 0) = J. 

Suppose next that A is not merely a vector space but a commutative algebra 
and that J is a proper ideal of A. If x' — x e J and y' — y e J, the identity 

x'y' -xy = (x' - x)y' + x(y' - y) (4) 

shows that x'y' — xy e J. Therefore multiplication can be defined in A/J in a 
consistent manner: 

cp{x)(p(y) = (p{xy) (x and y e A). (5) 

It is then easily verified that A/J is an algebra, and cp is a homomorphism of A 
onto A/J whose kernel is J. 

If A has a unit element e , then cp(e ) is the unit of A/J, and A/J is a field if and 
only if J is a maximaUieal. 

To see this, suppose x e A and x £ J, and put 

/ = {ax + y: a e A, y e J}. (6) 

Then / is an ideal in A which contains J properly, since x e /. If J is maximal, 
I = A, hence ax + y = e for some a e A and y e J, hence (p(a)q>(x) = <p(e); and 
this says that every nonzero element of A/J is invertible, so that A/J is a field. If J 
is not maximal, we can choose x as above so that / ^ A, hence e $ /, and then 
<p(x) is not invertible in A/J. 

18.15 Quotient Norms Suppose A is a normed linear space, J is a closed subspace 
of A , and (p{x ) = x + J, as above. Define 

||<p(x)|| = inf {||x + y|| : y e J}. (1) 

Note that ||<p(x)|| is the greatest lower bound of the norms of those elements 
which lie in the coset <p(x); this is the same as the distance from x to J. We call 
the norm defined in A/J by (1) the quotient norm of A/J. It has the following 
properties : 

(a) A/J is a normed linear space. 

(b) If A is a Banach space , so is A/J. 

(c) If A is a commutative Banach algebra and J is a proper closed ideal , then A/J 
is a commutative Banach algebra. 

These are easily verified: 



364 REAL AND COMPLEX ANALYSIS 


If x e J, ||<p(x)|| = 0. If x $ J, the fact that J is closed implies that ||<p(x)|| > 0. 
It is clear that ||2<p(x)|| — \X\ ||<p(x)||. If x x and x 2 e A and e > 0, there exist y x 
and y 2 e J so that 

Ik- + >’,•11 < ll<p(X|)|| +€ (/=1,2). (2) 

Hence 

+ x 2 )|| < II x 2 + X 2 + + y 2 || < ||</>(x 1 )|| + ||^>(x 2 )|| + 2c, (3) 

which gives the triangle inequality and proves (a). 

Suppose A is complete and {(p(x n )} is a Cauchy sequence in A/J. There is a 
subsequence for which 

\\<p(x„) - (p(x ni+1 )\\ <2~ l (i= 1,2,3,...), (4) 

and there exist elements z t so that z { — x n . e J and || z f — z i+1 || < 2~'\ Thus {zj is 
a Cauchy sequence in A ; and since A is complete, there exists z e A such that 
|| z f — z\\ — > 0. It follows that cp(x n .) converges to cp(z ) in A/J. But if a Cauchy 
sequence has a convergent subsequence, then the full sequence converges. Thus 
A/J is complete, and we have proved (b). 

To prove (c), choose x x and x 2 e A and e > 0, and choose y t and y 2 e J so 
that (2) holds. Note that (x x + y 1 )(x 2 + y 2 ) e x 1 x 2 + J 9 so that 

M*iX 2 )II < II(X! + yiXx 2 + >» 2 )|| ^ ||x x + y 1 ||||x 2 + y 2 ||. (5) 

Now (2) implies 

ll<p(x 1 x 2 )ll < ll^(x 1 )||||^(x 2 )||. (6) 

Finally, if e is the unit element of A , take x 1 ^ J and x 2 = e in (6); this gives 
\\(p(e)\\ > 1. But e e cp(e), and the definition of the quotient norm shows that 
\\q>(e)\\ < \\e\\ — 1. So \\(p(e)\\ = 1, and the proof is complete. 

18.16 Having dealt with these preliminaries, we are now in a position to derive 
some of the key facts concerning commutative Banach algebras. 

Suppose, as before, that A is a commutative complex Banach algebra with 
unit element e. We associate with A the set A of all complex homomorphisms of 
A; these are the homomorphisms of A onto the complex field, or, in different 
terminology, the multiplicative linear functionals on A which are not identically 0. 
As before, <t(x) denotes the spectrum of the element x e A, and p(x) is the spectral 
radius of x. 

Then the following relations hold: 

18.17 Theorem 

(a) Every maximal ideal M of A is the kernel of some h e A. 

(b) X e <t(x) if and only if h(x) = X for some h e A. 

(c) x is invertible in A if and only if h(x) # 0 for every he A. 

(d) h(x) e a(x) for every x e A and h e A. 

{e) | h(x) | < p(x) < ||x|| for every x e A and he A. 
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Proof If M is a maximal ideal of A , then A/M is a field; and since M is 
closed (Theorem 18.13), A/M is a Banach algebra. By Theorem 18.7 there is 
an isomorphism j of A/M onto the complex field. If h = j © <p, where cp is the 
homomorphism of A onto A/M whose kernel is M, then h e A and the kernel 
of h is M. This proves (a). 

If X g <r(x), then x — Xe is not invertible; hence the set of all elements 
(x — Xe)y, where y e A, is a proper ideal of A , which lies in a maximal ideal 
(by Theorem 18.13), and (a) shows that there exists an h e A such that 
h(x — Xe) = 0. Since h(e) = 1, this gives h(x) = X. 

On the other hand, if X $ <r(x), then (x — Xe)y = e for some y e A. It 
follows that h(x — Xe)h(y) = 1 for every /ieA, so that h(x — Xe) ^ 0, or 
h(x) t * X. This proves ( b ). 

Since x is invertible if and only if 0 4 <?(*)> (c) follows from (b). 

Finally, (d) and ( e ) are immediate consequences of ( b ). //// 


Note that ( e ) implies that the norm of h, as a linear functional, is at most 1. In 
particular, each h e A is continuous. This was already proved earlier (Theorem 
9.21). 


Applications 

We now give some examples of theorems whose statements involve no algebraic 
concepts but which can be proved by Banach algebra techniques. 

18.18 Theorem Let A(U) be the set of all continuous functions on the closure U 
of the open unit disc U whose restrictions to U are holomorphic. Suppose f u . . . , 
f n are members of A(U), such that 

I /l( z ) I + " ' + I fn( z ) I > 0 (1) 

for every z e U. Then there exist g u . . . , g n e A(U) such that 

t ft(z)g,{z) =1 (z e 0). (2) 

i — 1 


Proof Since sums, products, and uniform limits of holomorphic functions 
are holomorphic, A(U) is a Banach algebra, with the supremum norm. The 
set J of all functions lfig i9 where the g t are arbitrary members of A(U ), is an 
ideal of A(U). We have to prove that J contains the unit element 1 of A(U). 
By Theorem 18.13 this happens if and only if J lies in no maximal ideal of 
A(U). By Theorem 18.17(a) it is therefore enough to prove that there is no 
homomorphism h of A(U) onto the complex field such that h(f) = 0 for every 
i (1 < i < n). 
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Before we determine these homomorphisms, let us note that the poly- 
nomials form a dense subset of A(U). To see this, suppose / e >1(17) and € > 0; 
since / is uniformly continuous on U , there exists an r < 1 such that 
| f (z) — f(rz) | < € for all z e U; the expansion of f(rz) in powers of z con- 
verges if | rz | < 1, hence converges to f(rz) uniformly for z e U, and this gives 
the desired approximation. 

Now let h be a complex homomorphism of >1(17). Put / 0 (z) = z. Then 
/ 0 e >1(17). It is obvious that a(f 0 ) = U. By Theorem 18.17(d) there exists an 
a e U such that h(f 0 ) = a. Hence h(f n 0 ) = a" = /o(a), for n = 1, 2, 3, ..., so 
/i(P) = P(a) for every polynomial P. Since h is continuous and since the poly- 
nomials are dense in >1(17), it follows that h(f) = /(a) for every / e >1(17). 

Our hypothesis (1) implies that \f{oc)\>0 for at least one index i, 
1 < i < n. Thus h(fi) ^ 0. 

We have proved that to each h e A there corresponds at least one of the 
given functions / f such that h(f t ) ^ 0, and this, as we noted above, is enough 
to prove the theorem. //// 

Note : We have also determined all maximal ideals of A(U), in the course of 
the preceding proof, since each is the kernel of some h e A: // a e U and if M a is 
the set of all f e A(U) such that /(a) = 0, then M a is a maximal ideal of A(U ), and 
all maximal ideals of A(U) are obtained in this way . 

A(U) is often called the disc algebra. 

18.19 The restrictions of the members of >1(17) to the unit circle T form a closed 
subalgebra of C(T). This is the algebra >1 discussed in Example 18.11. In fact, >1 is 
a maximal subalgebra of C(T). More explicitly, if A a B a C{T) and B is a closed 
(relative to the supremum norm) subalgebra of C(T), then either B = A or B = C(T). 

It is easy to see (compare with Exercise 29, Chap. 17) that >1 consists pre- 
cisely of those f e C(T) for which 

f(n) = 7- J” 1 f(e ie )e- M id = 0 (n= -1, -2, -3,...). (1) 

Hence the above-mentioned maximality theorem can be stated as an approx- 
imation theorem: 

18.20 Theorem Suppose g e C(T) and g(n) ^ 0 for some n < 0. Then to every 
f e C(T) and to every € > 0 there correspond polynomials 

m(n) 

P n (e w )= ^a n:k e ike (n = 0,...,N) (1) 

k = 0 


such that 


!/(>*)- i Pn(e ie )g n (e i0 )\ < e 


B = 0 


(e w e T). 


(2) 
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Proof Let B be the closure in C = C(T) of the set of all functions of the form 

iPnd"- ( 3 ) 

n = 0 

The theorem asserts that B = C. Let us assume B ^ C. 

The set of all functions (3) (note that N is not fixed) is a complex algebra. 
Its closure B is a Banach algebra which contains the function / 0 , where 
fo(e iB ) = e 10 . Our assumption that B ^ C implies that l// 0 £ B , for otherwise B 
would contain /o for all integers n, hence all trigonometric polynomials 
would be in B; and since the trigonometric polynomials are dense in C 
(Theorem 4.25) we should have B = C. 

So / 0 is not invertible in B. By Theorem 18.17 there is a complex homo- 
morphism h of B such that h(f 0 ) = 0. Every homomorphism onto the 
complex field satisfies h(l) = 1 ; and since h(f 0 ) = 0, we also have 


h(f n o ) = lh(f o )T = 0 (n= 1,2,3,...). (4) 


We know that h is a linear functional on B , of norm at most 1. The 
Hahn-Banach theorem extends h to a linear functional on C (still denoted by 
h) of the same norm. Since h(l) = 1 and ||h|| < 1, the argument used in Sec. 
5.22 shows that h is a posi tive l inear functional on C. In particular, h(f) is 
real for real /; hence fr(/) = h{f). Since fo n is the complex conjugate of /J, it 
follows that (4) also holds for n = — 1, —2, — 3, . . . . Thus 


Hfo) = 


if n = 0, 
if n ± 0. 


( 5 ) 


Since the trigonometric polynomials are dense in C, there is only one 
bounded linear functional on C which satisfies (5). Hence h is given by the 
formula 


(. feC ). (6) 

Now if n is a positive integer, gf n 0 e B; and since h is multiplicative on B, 
(6) gives 

d{ = h f g{e ‘ e)ei " 6 de = h{gfl) = h{9)h{f " o) = °* (7) 

by (5). This contradicts the hypothesis of the theorem. //// 

We conclude with a theorem due to Wiener. 

18.21 Theorem Suppose 

f(e m ) = z c n e ine , z k.l < 00, 


( 1 ) 
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and f(e l °) 0 for every real 0. Then 

J^iej = _Z y» eM with Z I TJ < 00. 


( 2 ) 


Proof We let A be the space of all complex functions / on the unit circle 
which satisfy (1), with the norm 


11/11= I IcJ. (3) 

— oo 

It is clear that A is a Banach space. In fact, A is isometrically isomorphic to 
Z 1 , the space of all complex functions on the integers which are integrable 
with respect to the counting measure. But A is also a commutative Banach 
algebra, under pointwise multiplication. For if g e A and g(e ie ) = T,b n e in0 9 
then 


f(e w )9(e m ) = £ (l 

and hence 


( 4 ) 


ll/^ll = z 

n 


Y C n-k^k 

k 


<Y\ b k\Y\Cn-k\ = ll/ll ‘ llffll. 


( 5 ) 


Also, the function 1 is the unit of A , and || 1 1| = 1. 

Put f 0 (e ie ) = e i0 9 as before. Then / 0 e A , l// 0 e A , and ||/ol| = 1 for n = 0, 
±1, ±2, .... If ft is any complex homomorphism of A and h(f 0 ) = A, the fact 
that \\h\\ < 1 implies that 


m = l Kfo ) I < ll/SII = 1 (n = o, ± 1, ±2, . . .). (6) 

Hence |A| = 1. In other words, to each h corresponds a point e ia e T such 
that h(f 0 ) = e ia 9 so 

Kfo) = =f n o(e ia ) (n = 0, ± 1, ± 2, . . .). (7) 

If /is given by (1), then / = £c„/ o- This series converges in ,4; and since 
h is a continuous linear functional on A 9 we conclude from (7) that 

h(f)=f(e ia ) (feA). (8) 


Our hypothesis that / vanishes at no point of T says therefore that / is 
not in the kernel of any complex homomorphism of A 9 and now Theorem 
18.17 implies that /is invertible in A. But this is precisely what the theorem 
asserts. I HI 
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Exercises 

1 Suppose B(X ) is the algebra of all bounded linear operators on the Banach space X , with 

11/4x11 

Mi + A 2 ){x) = A x x + A 2 x , MM 2 Xx) = A 1 (A 2 x), MU = sup , 


if A, A v and A 2 e B(X). Prove that B(X) is a Banach algebra. 

2 Let n be a positive integer, let X be the space of all complex n-tuples (normed in any way, as long 
as the axioms for a normed linear space are satisfied), and let B(X) be as in Exercise 1. Prove that the 
spectrum of each member of B(X) consists of at most n complex numbers. What are they? 

3 Take X = L 2 (— oo, oo), suppose (p e L°°(— oo, oo), and let M be the multiplication operator which 
takes / e L 2 to (pf Show that M is a bounded linear operator on L 2 and that the spectrum of M is 
equal to the essential range of (p (Chap. 3, Exercise 19). 

4 What is the spectrum of the shift operator on *f 2 ? (See Sec. 17.20 for the definition.) 

5 Prove that the closure of an ideal in a Banach algebra is an ideal. 

6 If X is a compact Hausdorff space, find all maximal ideals in C(X). 

7 Suppose A is a commutative Banach algebra with unit, which is generated by a single element x. 
This means that the polynomials in x are dense in A. Prove that the complement of <r(x) is a con- 
nected subset of the plane. Hint : If X $ c(x), there are polynomials P n such that P„(x)— ► (x — Xe)~ l in 
A. Prove that P„(z)— ► (z — X )~ 1 uniformly for z e o(x). 

8 Suppose ]Tq |c„| < oo, /(z) = YjS c„z", I /Ml > 0 for every z e U, and l//(z) = Y,o a n z "- Prove 
that \a„ \ < oo. 

9 Prove that a closed linear subspace of the Banach algebra 1}{R 1 ) (see Sec. 9.19) is translation 
invariant if and only if it is an ideal. 

10 Show that l}(T) is a commutative Banach algebra (without unit) if multiplication is defined by 

(/ « 9\t) = 2- j* fit - s)g(s ) ds. 

Find all complex homomorphisms of 1}(T), as in Theorem 9.23. If £ is a set of integers and if I E is the 
set of all / e I}{T) such that f{n) — 0 for all n e £, prove that I E is a closed ideal in I}{T), and prove 
that every closed ideal in I}{T) is obtained in this manner. 

1 1 The resolvent R(X, x) of an element x in a Banach algebra with unit is defined as 

R(X, x) = (Ae — x) _ 1 

for all complex X for which this inverse exists. Prove the identity 

R{X, x) — R{fi, x) = {ji — X)R(X)R(ji) 
and use it to give an alternative proof of Theorem 18.5. 

12 Let A be a commutative Banach algebra with unit. The radical of A is defined to be the intersec- 
tion of all maximal ideals of A. Prove that the following three statements about an element x e A are 
equivalent : 

(u) x is in the radical of A. 

(b) lim ||x"|| 1/n = 0. 

oo 

(c) h(x) = 0 for every complex homomorphism of A. 

13 Find an element x in a Banach algebra A (for instance, a bounded linear operator on a Hilbert 
space) such that x" ^ 0 for all n > 0, but lim„_* 0 ||x"|| 1/n = 0. 
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14 Suppose A is a commutative Banach algebra with unit, and let A be the set of all complex homo- 
morphisms of A, as in Sec. 18.16. Associate with each x e A a function x on A by the formula 

x{h) — h(x) ( h e A). 


x is called the Gelfand transform of x. 

Prove that the mapping x — ► x is a homomorphism of A onto an algebra A of complex functions 
on A, with pointwise multiplication. Under what condition on A is this homomorphism an iso- 
morphism? (See Exercise 12.) 

Prove that the spectral radius p(x) is equal to 

ll*IL = sup h s A}. 

Prove that the range of the function x is exactly the spectrum a(x). 

15 If A is a commutative Banach algebra without unit, let A ! be the algebra of all ordered pairs (x, A), 
with x e A and A a complex number; addition and multiplication are defined in the “obvious” way, 
and ||(x, A) || = ||x|| + |A|. Prove that A t is a commutative Banach algebra with unit and that the 
mapping x— ► (x, 0) is an isometric isomorphism of A onto a maximal ideal of A v This is a standard 
embedding of an algebra without unit in one with unit. 

16 Show that H 00 is a commutative Banach algebra with unit, relative to the supremum norm and 
pointwise addition and multiplication. The mapping /—►/(a) is a complex homomorphism of if 00 , 
whenever | a| < 1. Prove that there must be others. 

17 Show that the set of all functions (z — 1 ) 2 /, where / e H 00 , is an ideal in if 00 which is not closed. 
Hint : 


|(1 -z) 2 (l + <• -z)" 1 — (1 — z) I < € if |z| < l,e>0. 

18 Suppose (p is an inner function. Prove that {(pf: f e H 00 } is a closed ideal in H 00 . In other words, 
prove that if {/„} is a sequence in H 00 such that (pf n -+ g uniformly in U, then g/(p e H°°. 



CHAPTER 

NINETEEN 

HOLOMORPHIC FOURIER TRANSFORMS 


Introduction 

19.1 In Chap. 9 the Fourier transform of a function f on R l was defined to be a 
function / on JR 1 . Frequently / can be extended to a function which is holo- 
morphic in a certain region of the plane. For instance, if f(t) = e~^\ then f(x) = 
(1 + x 2 ) -1 , a rational function. This should not be too surprising. For each real t, 
the kernel e itz is an entire function of z, so one should expect that there are 
conditions on / under which / will be holomorphic in certain regions. 

We shall describe two classes of holomorphic functions which arise in this 
manner. 

For the first one, let F be any function in L 2 (— oo, oo) which vanishes on 
( — oo, 0) [i.e., take F e L 2 ( 0, oo)] and define 


/(*) = 



F(t)e itz dt 


(zeU + l 


( 1 ) 


where n + is the set of all z = x + iy with y > 0. If zell + , then \e itz \ = e~ ty 9 
which shows that the integral in (1) exists as a Lebesgue integral. 

If Im z > S > 0, Im z n > <5, and z„— ► z, the dominated convergence theorem 
shows that 


*00 

lim | exp ( itz n ) — exp ( itz ) | 2 dt = 0 

n-* oo JO 

because the integrand is bounded by the L 1 -function 4 exp ( — 2 St) and tends to 0 
for every t > 0. The Schwarz inequality implies therefore that / is continuous in 
n + . The theorems of Fubini and Cauchy show that /(z) dz = 0 for every 
closed path y in n + . By Morera’s theorem,/ e H(U + ). 
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Let us rewrite (1) in the form 

/(x 4- iy) = J F(t)e~ ty e ltx dt, (2) 

regard y as fixed, and apply Plancherel’s theorem. We obtain 

L f°° |/(x + iy)| 2 dx = f"|F(t)l 2 e _2 " dt < f°°|F(t)| 2 dt (3) 

* 71 J- oo JO Jo 

for every y > 0. [Note that our notation now differs from that in Chap. 9. There 
the underlying measure was Lebesgue measure divided by -Jin. Here we just use 
Lebesgue measure. This accounts for the factor 1/(2 n) in (3).] This shows: 

(a) Iff is of the form (1), then f is holomorphic in II + and its restrictions to horizon- 
tal lines in II + form a bounded set in L 2 ( — oo, oo). 

Our second class consists of all / of the form 

m = f/w ' 2 dt ( 4 ) 

where 0 < A < oo and F e L 2 ( — A, A). These functions / are entire (the proof is 
the same as above), and they satisfy a growth condition: 

I m | < \ A | m | e-*' dt < e A M [ A I Fit) \ dt. (5) 

J-A J-A 

If C is this last integral, then C < oo, and (5) implies that 

\f(z)\<Ce A '*'. (6) 

[Entire functions which satisfy (6) are said to be of exponential type.' ] Thus: 

(b) Every f of the form (4) is an entire function which satisfies (6) and whose 
restriction to the real axis lies in l3 (by the Plancherel theorem). 

It is a remarkable fact that the converses of (a) and {b) are true. This is the 
content of Theorems 19.2 and 19.3. 

Two Theorems of Paley and Wiener 
19.2 Theorem Suppose f e H(Tl + ) and 

sup f f \ f(x + iy)\ 2 dx = C < CO. (1) 

0 <y < oo J - oo 

Then there exists an F e L 2 ( 0, oo) such that 

f(z) = ^F(t)e i,z dt (z e IT + ) 


( 2 ) 
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and 


JjFWI 2 dt = C. (3) 

Note : The function F we are looking for is to have the property that f(x + iy) is 
the Fourier transform of F(t)e~ yt (we regard y as a positive constant). Let us 
apply the inversion formula (whether or not this is correct does not matter; we 
are trying to motivate the proof that follows): The desired F should be of the 
form 


m = ** • L J°° f(x + iy)e~ i,x dx = ± dz. (4) 

The last integral is over a horizontal line in n + , and if this argument is correct at 
all, the integral will not depend on the particular line we happen to choose. This 
suggests that the Cauchy theorem should be invoked. 


Proof Fix y, 0 < y < oo. For each a > 0 let T a be the rectangular path with 
vertices at + a + i and ± a + iy. By Cauchy’s theorem 


f(z)e~ it2 dz = 0. (5) 

Jr* 

We consider only real values of t. Let <!>(/?) be the integral oif(z)e~ itz over 
the straight line interval from /? + i to ft + iy (fl real). Put I = [y, 1] if y < 1, 
J = y] if 1 < y* Then 


W)l 2 = 


f(P + iu)e lt ^ +iu) du < \ \f(P + it/) | 2 du f e 2tu du. (6) 

Ji Ji Ji 


Put 


-i 


m= \ \fw + iu )\ 2 du. 


Then (1) shows, by Fubini’s theorem, that 


_L f 00 

J-o 


A (/?) dp < Cm(I). 


Hence there is a sequence {a,-} such that a oo and 

A(a y ) + A(-aj)-> 0 0’ <»)• 

By (6), this implies that 

d^ay)— »0, d>( — a^— > 0 as /— ► co. 


( 7 ) 

( 8 ) 

( 9 ) 


( 10 ) 
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Note that this holds for every t and that the sequence {a 7 } does not depend 
on t. 

Let us define 


0/;M) = ^j /(* + iy)e itx dx. (11) 

Then we deduce from (5) and (10) that 

lim | \e ,y gj(y, t) - efgfl, t)] = 0 (- oo < t < oo). (12) 

j~* °0 

Write f y (x) for f(x -f iy). Then f y e L 2 (— oo, oo), by hypothesis, and the 
Plancherel theorem asserts that 


lim \f y (t) — g/y, t) | 2 dt = 0, (13) 

j~* oo J— OO 

where f y is the Fourier transform oif y . A subsequence of {g/y, t)} converges 
therefore pointwise to f y (t) 9 for almost all t (Theorem 3.12). If we define 

m = e'hit\ (14) 


it now follows from (12) that 

m = ^//o. (i5) 

Note that (14) does not involve y and that (15) holds for every y e (0, oo). 
Plancherel’s theorem can be applied to (15): 

J ” 0 e~ 2,y \ m | 2 dt = I f y (t) | 2 dt = ±^\ fy(x) | 2 dx < C. (16) 

If we let y — ► oo, (16) shows that F(t) = 0 a.e. in ( — oo, 0). 

If we let y— ► 0, (16) shows that 

^\F(t )\ 2 dt<C. (17) 

It now follows from (15) that f y e L 1 if y > 0. Hence Theorem 9.14 gives 

fy(x)= (* fy(t)e i,x dt (18) 

J — OO 

or 

f(z)= \ F(t)e- y, e i,x dt = \F(t)e i,z dt (z e n + ). (19) 

Jo Jo 

This is (2), and now (3) follows from (17) and formula 19.?; 3). 


//// 
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19.3 Theorem Suppose A and C are positive constants and f is an entire func- 
tion such that 


|/(z)| < Ce A ' z ' 


for all z, and 


r 


|/(x)| 2 dx < 00 . 


Then there exists an F e L 2 ( — A, A) such that 


-j: 


f(z) = | F(t)e‘<> dt 

-A 


(l) 


( 2 ) 


( 3 ) 


for all z. 


Proof Put f(x) = f(x)e £|x| , for e > 0 and x real. We shall show that 

lim j f(x)e~ itx dx = 0 (t real, 1 1 1 > A). (4) 

£-+0 J — 00 

Since \\f— /|| 2 — 3 ►O as e— ►(), the Plancherel theorem implies that the 
Fourier transforms of f e converge in L 2 to the Fourier transform F of f (more 
precisely, of the restriction of / to the real axis). Hence (4) will imply that F 
vanishes outside [ — A , A\ and then Theorem 9.14 shows that (3) holds for 
almost every real z. Since each side of (3) is an entire function, it follows that 
(3) holds for every complex z. 

Thus (4) implies the theorem. 

For each real a, let T a be the path defined by 

r a (s) = se ia (0 < s < oo), (5) 


put 

n a = (w: Re (we ia ) > A }, (6) 


and if w e n a , define 

<D a (w) = j* f(z)e- wz dz = e ix j f(se ix ) exp (~wse ix ) ds. (7) 

By (1) and (5), the absolute value of the integrand is at most 
C exp { — [Re ( we ia ) — 4]s}, 

and it follows (as in Sec. 19.1) that O a is holomorphic in the half plane II a . 
However, more is true if a = 0 and if a = n: We have 

O 0 (w) = J f(x)e~ wx dx (Re w > 0), 


( 8 ) 
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<D n (w) = - f f(x)e~ wx dx (Re w < 0). (9) 

J— 00 

O 0 and are holomorphic in the indicated half planes because of (2). 

The significance of the functions O a to (4) lies in the easily verified rela- 
tion 


*00 

f £ (x)e~ itx dx = O 0 (e + it) — 0^ — 6 + it) (t real). (10) 

J- 00 

Hence we have to prove that the right side of (10) tends to 0 as e— ► 0, if t > A 
and if t < — A. 

We shall do this by showing that any two of our functions O a agree in 
the intersection of their domains of definition, i.e., that they are analytic con- 
tinuations of each other. Once this is done, we can replace <D 0 and <!> n by <S> n/2 
in (10) if t < — A, and by 0>- n/2 if t > A, and it is then obvious that the 
difference tends to 0 as e— ► 0. 

So suppose 0 < P — a < n. Put 


7 = 


a + jg 
2 


rj — cos 


P-* 

2 


>0. 


( 11 ) 


If w = | w | e iv , then 

Re (we ix ) = r\ | w | = Re {we ifi ) ( 1 2) 

so that w e n a n n„ as soon as | w | > A/rj. Consider the integral 

jV(z)*-"” dz (13) 

over the circular arc T given by T(t) = re lt , (x<t<p. Since 

Re ( — wz) = — | w | r cos (t — y) < — | w | rrj, (14) 

the absolute value of the integrand in (13) does not exceed 

C exp {(A - | w| i/)r}. 

If | w | > A/rj it follows that (13) tends to 0 as r— ► oo. 

We now apply the Cauchy theorem. The integral of f(z)e~ wz over the 

interval [0, re ip ~\ is equal to the sum of (13) and the integral over [0, re i<x ~\. 

Since (13) tends to 0 as r— ► oo, we conclude that O a (w) = O^w) if w = | w | e~ iy 
and |w| > A/rj, and then Theorem 10.18 shows that O a and coincide in 
the intersection of the half planes in which they were originally defined. 

This completes the proof. //// 

19.4 Remarks Each of the two preceding proofs depended on a typical appli- 
cation of Cauchy’s theorem. In Theorem 19.2 we replaced integration over 
one horizontal line by integration over another to show that 19.2(15) was 
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independent of y. In Theorem 19.3, replacement of one ray by another was 
used to construct analytic continuations; the result actually was that the 
functions O a are restrictions of one function O which is holomorphic in the 
complement of the interval [ — Ai 9 Ai\. 

The class of functions described in Theorem 19.2 is the half plane ana- 
logue of the class H 2 discussed in Chap. 17. Theorem 19.3 will be used in the 
proof of the Denjoy-Carleman theorem (Theorem 19.11). 


Quasi-analytic Classes 

19.5 If Q is a region and if z 0 e Q, every / e H( Q) is uniquely determined by the 
numbers / (z 0 ), f\z Q \ f"(z 0 ), On the other hand, there exist infinitely differen- 
tiable functions on JR 1 which are not identically 0 but which vanish on some 
interval. Thus we have here a uniqueness property which holomorphic functions 
possess but which does not hold in C 00 (the class of all infinitely differentiable 
complex functions on R 1 ). 

If f e H(Q) 9 the growth of the sequence { | f {n \z 0 ) | } is restricted by Theorem 
10.26. It is therefore reasonable to ask whether the above uniqueness property 
holds in suitable subclasses of C 00 in which the growth of the derivatives is 
subject to some restrictions. This motivates the following definitions; the answer 
to our question is given by Theorem 19.11. 


19.6 The Classes C{M n } If Af 0 , M l9 M 2 , ... are positive numbers, we let C{M n } 
be the class of all /eC 00 which satisfy inequalities of the form 


\m\\„<P f B* f M n (n = 0, 1,2,...). (1) 

Here D°f = f D n f is the nth derivative of / if n > 1, the norm is the supremum 
norm over R 1 , and and B f are positive constants (depending on /, but not on 
n). 

If / satisfies (1), then 


lim sup 

n~> oo 



(2) 


This shows that B f is a more significant quantity than /? f . However, if were 
omitted in (1), the case n = 0 would imply H/H*, <M 0 , an undesirable 
restriction. The inclusion of P f makes C{M n } into a vector space. 

Each C{M n } is invariant under affine transformations. More explicitly, suppose 
/ e C{M n } and g(x) =f(ax - b b). Then g satisfies (1), with = P f and B g — aB f . 

We shall make two standing assumptions on the sequences {M„} under con- 
sideration : 


M 0 = 1. 

Ml < (n = 1, 2, 3, ...)• 

Assumption (4) can be expressed in the form: {log M„} is a convex sequence. 


( 3 ) 

( 4 ) 
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These assumptions will simplify some of our work, and they involve no loss 
of generality. [One can prove, although we shall not do so, that every class 
C{M n } is equal to a class C{M„}, where {M„} satisfies (3) and (4).] 

The following result illustrates the utility of (3) and (4): 

19.7 Theorem Each C{M n } is an algebra , with respect to pointwise multiplica- 
tion. 

Proof Suppose / and g e C{M n } 9 and f$ f9 B f9 P g9 and B g are the correspond- 
ing constants. The product rule for differentiation shows that 

D"(fg)= £ ( n \D j f) ■ (D"- J g). (1) 

j= 0 \J / 


Hence 


\D n (fg)\ < P f p 9 £ ( (2) 
j = 0 \J / 

The convexity of {log M„}, combined with M 0 = 1, shows that MjM n _j < 
M n for 0 < j < n. Hence the binomial theorem leads from (2) to 

mfg)\\« < PfU B f + B 9 TM h (n = 0, 1, 2, ...), (3) 

so that fg e //// 

19.8 Definition A class C{M n } is said to be quasi-analytic if the conditions 

/ g C{M„}, (D n f)( 0) = 0 (for n = 0, 1, 2, . . .) (1) 

imply that f(x) = 0 for all x e R 1 . 

The content of the definition is of course unchanged if (D n f)( 0) is replaced 
by (D n f)(x 0 ) 9 where x 0 is any given point. 

The quasi-analytic classes are thus the ones which have the uniqueness 
property we mentioned in Sec. 19.6. One of these classes is very intimately 
related to holomorphic functions : 

19.9 Theorem The class C{n\} consists of all f to which there corresponds a 
S > 0 such that f can be extended to a bounded holomorphic function in the 
strip defined by | Im(z) | < S . 

Consequently C{n\} is a quasi-analytic class . 

Proof Suppose / e H(Q) and \f(z) \ < for all zeQ, where Q consists of all 
z = x + iy with \ y\ <3. It follows from Theorem 10.26 that 


\W)(x)\ <pd~ n n\ (n = 0, 1,2,...) (1) 

for all real x. The restriction of / to the real axis therefore belongs to C{n\}. 
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Conversely, suppose /is defined on the real axis and / e C{n\}. In other 
words, 


imWn^pB-nl (n = 0,1,2,...). (2) 

We claim that the representation 

f(x) = 

is valid for all a e R 1 if a — B 
formula 


V WfXa), v, 


n = 0 

- 1 


(3) 


< x < a + B \ This follows from Taylor’s 


" _1 (D j f\(a\ 1 C x 

m = i (x - + - — - (x - ty-'mm n, (4) 

j=0 j'- 

which one obtains by repeated integrations by part. By (2) the last term in (4) 
(the “ remainder ”) is dominated by 


npB n 


Ja 


(x - t) 


n- 1 


dt 


= P I B(x — a) |". 


(5) 


If | B(x — a) | < 1, this tends to 0 as n — > oo, and (3) follows. 

We can now replace x in (3) by any complex number z such that 
| z — a | < l/B. This defines a holomorphic function F a in the disc with center 
at a and radius l/B, and F a (x) = f{x) if x is real and \x — a\ < l/B. The 
various functions F a are therefore analytic continuations of each other; they 
form a holomorphic extension F of/ in the strip \ y\ < l/B. 

If 0 < S < l/B and z = a + iy 9 \ y \ < <5, then 


I F(z) | = | F a (z) | = 


f 

» = o n! 


< Z (B<5)" = 

n = 0 


P 

1 -B5 


This shows that F is bounded in the strip \y \ < <5, and the proof is complete. 

//// 


19.10 Theorem The class C{M n } is quasi-analytic if and only if C{M n } con- 
tains no nontrivial function with compact support . 

Proof If C{M n } is quasi-analytic, iff e C{M„}, and if / has compact support, 
then evidently / and all its derivatives vanish at some point, hence /(x) = 0 
for all x. 

Suppose C{M n } is not quasi-analytic. Then there exists an / e C{M n } 
such that (D"/)( 0) = 0 for n = 0, 1, 2, ... , but /(x 0 ) ^ 0 for some x 0 . We may 
assume x 0 > 0. If g(x) = / (x) for x > 0 and g(x) = 0 for x < 0, then g e 
C{M n }. Put h(x) = 0(x)0(2x o — x). By Theorem 19.7, h e C{M n }. Also, 
h(x) = 0 if x < 0 and if x > 2x 0 . But h(x 0 ) = / 2 (x 0 ) ^ 0. Thus h is a non- 
trivial member of C{M n } with compact support. //// 
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We are now ready for the fundamental theorem about quasi-analytic clases. 


The Denjoy-Carleman Theorem 


19.11 Theorem Suppose M 0 = 1, Ml < M n _ l M n + l for n = 1, 2, 3, and 


n = 0 M n 


q(x) = sup— , 

.2:0 M n 


for x > 0. Then each of the following five conditions implies the other four : 
(a) C{M„} is not quasi-analytic. 


(b) 

(c) 


r 

r 


log Q(X ) 


log q(x) 


dx 


1 + x^ 


< 00 . 


dx 


1 -f- x 4 


< 00 . 


1 \ 1/w 

w I M < »■ 


n= 1 


oo 


(<?) X 

n= 1 


M„-l 


< 


00 . 


Note: If M n — > oo very rapidly as n— ► oo, then Q(x) tends to infinity slowly as 
x— ► oo. Thus each of the five conditions says, in its own way, that M„— ► oo 
rapidly. Note also that Q(x) > 1 and q(x) > 1. The integrals in ( b ) and (c) are thus 
always defined. It may happen that Q[x) = oo for some x < oo. In that case, the 
integral ( b ) is + oo, and the theorem asserts that C{M n } is quasi-analytic. 

If M n = n\ , then M n _JM n = l/n, hence (e) is violated, and the theorem 
asserts that C{n\} is quasi-analytic, in accordance with Theorem 19.9. 


Proof that (a) implies ( b ) Assume that C{M n } is not quasi-analytic. Then 
C{M n } contains a nontrivial function with compact support (Theorem 19.10). 
An affine change of variable gives a function F e C{M n }, with support in 
some interval [0, A], such that 

\\D n F\\ a0 < 2~ n M n (n = 0, 1,2,...) (1) 

and such that F is not identically zero. Define 

/(z) = ^F(ty" dt (2) 

and 


g(w) =/ 



( 3 ) 
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Then / is entire. If Im z > 0, the absolute value of the integrand in (2) is 
at most \F(t)\. Hence / is bounded in the upper half plane; therefore g is 
bounded in U. Also, g is continuous on U, except at the point w = — 1. Since 
/ is not identically 0 (by the uniqueness theorem for Fourier transforms) the 
same is true of g , and now Theorem 15.19 shows that 


2 n 



(4) 


If x = i( 1 — e t6 )/(\ + e ld ) = tan (0/2), then dO = 2(1 + x 2 ) 1 dx, so (4) is the 
same as 


if 


log |/(x)| 


dx 


• > — 00 . 


1 + x 2 

On the other hand, partial integration of (2) gives 


f(z) = W 


{D n i 


F\t)e i,z dt (z * 0) 


(5) 


( 6 ) 


since F and all its derivatives vanish at 0 and at A. It now follows from (1) 
and (6) that 


| x n f(x) | < 2 n AM n (x real, n = 0, 1, 2, . . .). 


(7) 


Hence 


00 x n I f(x) I 

Q(x) I f(x) | = X J <2A (x > 0), (8) 

and (5) and (8) imply that (b) holds. //// 

Proof that (b) implies (c) q(x) < Q(x). //// 

Proof that (c) implies (d) Put a n = M \ >n . Since M 0 = 1 and since M 2 < 
M n _ 1 M n+1 , it is easily verified that a n < a n+l , for n > 0. If x > ea n , then 
x n /M n > e n , so 


log q(x) > log 


M n 


> log e n = n. 


(9) 


Hence 


e 


f 


log ^(x) 


dx 


> e 



*00 

dx + e (N H- l)x -2 dx 

JeaN + 1 


Z n(~ -) + ^ = N i 1 - 

n= 1 \ a n a n+lJ a N + 1 n= 1 a n 


(10) 


for every N. This shows that (c) implies (d). 


Illl 
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Proof that (< d ) implies ( e ) Put 




M„-i 

M n 


(n= 1, 2, 3,...). 


( 11 ) 


Then > X 2 ^ ^3 ^ “ * , and if a n = M* /n , as above, we have 

{a n X n ) n <M u - X n = 1 . (12) 

Thus < l/a„ , and the convergence of £(1 /a n ) implies that of . //// 


Proof that (e) implies (a) The assumption now is that < 00 , where X n is 
given by (11). We claim that the function 


/(*) = 



sin X n z 

K* 


(13) 


is an entire function of exponential type, not identically zero, which satisfies 
the inequalities 

|x*/(x)| < M k(^^ 

Note first that 1 — z" 1 sin z has a zero at the origin. Hence there is a 
constant B such that 


(x real, k = 0, 1, 2, . . .). 


(14) 


<B\z\ (|z| < 1). (15) 

<Bk n \z\ (|z|^£), (16) 

so that the series 


It follows that 


sin X n 


00 


z 


sin X n z 

X n z 


(17) 


converges uniformly on compact sets. (Note that 1/A„— ► 00 as n— ► 00 , since 
< 00 .) The infinite product (13) therefore defines an entire function / 
which is not identically zero. 

Next, the identity 

’rMJr* 

shows that | z~ 1 sin z | ^ e w if z = x + iy. Hence 


| /(z) | < e Mz \ with A = 2+ £ X n . 

n= 1 


( 19 ) 



HOLOMORPHIC FOURIER TRANSFORMS 383 


For real x , we have | sin x \ < \ x | and | sin x | < 1. Hence 


\x k f(x) \ <\x k \ 


sin x 


sin x 
x 
2 


2 k 

n 

> 1=1 


sin ?.„x\ 


Lx 


(L ■■■L) 


--M- 


This gives (14), and if we integrate (14) we obtain 

\x k f(x)\dx^M k (fc = 0,1,2,...). 


if 


( 20 ) 


( 21 ) 


We have proved that / satisfies the hypotheses of Theorem 19.3. The 
Fourier transform of f. 


F(t) = ^- ( f(x)e~‘ ,x dx (t real) (22) 

2n J - 00 

is therefore a function with compact support, not identically zero, and (21) 
shows that FeC® and that 


(D k F)(t) = ^ j" (~ix) k /(x)e “ X dx, (23) 

by repeated application of Theorem 9.2(f). Hence ||Z>*i r || ^ < M k , by (21), 
which shows that F e C{M n }. 

Hence C{M n } is not quasi-analytic, and the proof is complete. //// 


Exercises 


1 Suppose /is an entire function of exponential type and 


<p(y) = 


|/(x + 1>)| 2 dx. 

J - QO 


Prove that either q>(y) = oo for all real y or <p(y) < oo for all real y. Prove that / = 0 if (p is a bounded 
function. 

2 Suppose / is an entire function of exponential type such that the restriction of / to two nonparallel 
lines belongs to L 2 . Prove that f - 0. 

3 Suppose / is an entire function of exponential type whose restriction to two nonparallel lines is 
bounded. Prove that /is constant. (Apply Exercise 9 of Chap. 12.) 

4 Suppose /is entire, \f(z) \ < C exp {A \ z \ ), and /(z) = £a n z". Put 


<*»0=zS- 


Prove that the series converges if | w | > A, that 
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if T(t) = (A + e)e lt , 0 < t < 2n, and that O is the function which occurred in the proof of Theorem 
19.3. (See also Sec. 19.4.) 

5 Suppose / satisfies the hypothesis of Theorem 19.2. Prove that the Cauchy formula 


/M 


1 f°° 


/« + te) 

€ + ie — z 


dt; (0 < € < y) 


(*) 


holds; here z = x + iy. Prove that 


/*(*) = lim/(x + iy) 

y~*o 

exists for almost all x. What is the relation between /* and the function F which occurs in Theorem 
19.2? Is (*) true with e = 0 and with /* in place of/ in the integrand? 

6 Suppose (p e L 2 (— oo, oo) and q> > 0. Prove that there exists an / with \f\ = (p such that the Fourier 
transform of / vanishes on a half line if and only if 


i: 


dx 

log <p(x) — — J : 

, 1 + X^ 


Suggestion : Consider /*, as in Exercise 5, where / = exp (u + it?) and 



(x - t) 2 + y 2 


log cp{t) dt. 


I Let /be a complex function on a closed set E in the plane. Prove that the following two conditions 
on / are equivalent : 

(a) There is an open set Q => E and a function F e H(Q) such that F(z) = / (z) for z e E. 

{b) To each a e E there corresponds a neighborhood V a of a and a function F a e HiV^ such that 
F a (z) =/(z) in V a n E. 

(A special case of this was proved in Theorem 19.9.) 

8 Prove that C{n!} = €{n"}. 

9 Prove that there are quasi-analytic classes which are larger than C{n \ }. 

10 Put X n - as in the proof of Theorem 19.11. Pick g 0 e C/R 1 ), and define 

9n(x) = (2AJ- 1 »..,(* - t) dt (n = 1, 2, 3, . . .). 

Prove directly (without using Fourier transforms or holomorphic functions) that g = lim g n is a func- 
tion which demonstrates that (< e ) implies (a) in Theorem 19.11. (You may choose any g 0 that is conve- 
nient.) 

II Find an explicit formula for a function (p e C 00 , with support in [ — 2, 2], such that (p{x ) = 1 if 
-1 < x < 1. 

12 Prove that to every sequence {a„} of complex numbers there corresponds a function / e C 00 such 
that (D"/X0) = a„ for n = 0, 1, 2, .... Suggestion: If (p is as in Exercise 11, if p n = ccjn\, if g n w- 
p„ x"<p(x), and if 


/„(*) = K "Snik = Pn X "<P(k ^c). 

then || D*/„||„ < 2"” for k = 0, . ... it — 1, provided that A„is large enough. Take f —If 
13 Construct a function / e C x such that the power series 

I — — (X - ar 

n = 0 n • 
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has radius of convergence 0 for every a e R 1 . Suggestion : Put 

/(*) = E c k e iXkX , 

k= 1 

where {c k } and {A k } are sequences of positive numbers, chosen so that Ec k A£ < oo for n = 0, 1, 2, ... 
and so that c„ AJ| increases very rapidly and is much larger than the sum of all the other terms in the 
series Lc k AJ . 

For instance, put c k = A k _k , and choose {A k } so that 

k- 1 

A k > 2 Cj A* and A k > k 2k . 

j= i 

14 Suppose C{M n } is quasi-analytic, / e C{M n }, and f(x) = 0 for infinitely many x e [0, 1]. What 
follows? 

15 Let X be the vector space of all entire functions / that satisfy |/(z) | < Ce* ,zl for some C < oo, and 
whose restriction to the real axis is in L 2 . Associate with each f e X its restriction to the integers. 
Prove that /— ► {/(n)} is a linear one-to-one mapping of X onto £ 2 . 

16 Assume / is a measurable function on (— oo, oo) such that |/(x)| < e~ M for all x. Prove that its 
Fourier transform / cannot have compact support, unless f{x) — 0 a.e. 



CHAPTER 

TWENTY 


UNIFORM APROXIMATION 
BY POLYNOMIALS 


Introduction 

20.1 Let K° be the interior of a compact set K in the complex plane. (By defini- 
tion, K° is the union of all open discs which are subsets of K ; of course, K° may 
be empty even if K is not.) Let P(K) denote the set of all functions on K which 
are uniform limits of polynomials in z. 

Which functions belong to P(K)? 

Two necessary conditions come to mind immediately: If f e P(K), then 
fe C(K) and / e H(K°). 

The question arises whether these necessary conditions are also sufficient. 
The answer is negative whenever K separates the plane (i.e., when the com- 
plement of K is not connected). We saw this in Sec. 13.8. On the other hand, if K 
is an interval on the real axis (in which case K° = 0), the Weierstrass approx- 
imation theorem asserts that 


P(K) = C(K). 

So the answer is positive if K is an interval. Runge’s theorem also points in this 
direction, since it states, for compact sets K which do not separate the plane, that 
P(K) contains at least all those / e C(K) which have holomorphic extensions to 
some open set Q => K. 

In this chapter we shall prove the theorem of Mergelyan which states, 
without any superfluous hypotheses, that the above-mentioned necessary condi- 
tions are also sufficient if K does not separate the plane. 

The principal ingredients of the proof are: Tietze’s extension theorem, a 
smoothing process invoving convolutions, Runge’s theorem, and Lemma 20.2, 
whose proof depends on properties of the class 9* which was introduced in Chap. 
14. 


386 
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Some Lemmas 


20.2 Lemma Suppose D is an open disc of radius r > 0, E a D, E is compact 
and connected , Q = S 2 — E is connected , and the diameter of E is at least r. 
Then there is a function g e H(£l) and a constant b 9 with the following property : 
if 


Q(L z) = g(z) + (C - b)g 2 (z), 


the inequalities 


I G(C, z) | < 


100 


Q(C, z) - 


hold for all z e Cl and for all £ e D. 


l,000r 2 

Iz-CI 3 


(1) 

( 2 ) 

( 3 ) 


We recall that S 2 is the Riemann sphere and that the diameter of E is the 
supremum of the numbers | z 1 — z 2 1, where e E and z 2 e E. 


Proof We assume, without loss of generality, that the center of D is at the 
origin. So D = D( 0; r). 

The implication (d)^>(b) of Theorem 13.11 shows that Q is simply con- 
nected. (Note that oo e Cl.) By the Riemann mapping theorem there is there- 
fore a conformal mapping F of U onto Q such that F(0) = oo. F has an 
expansion of the form 


F(w) = - + £ c n w" (we U). (4) 

W n = Q 

We define 

g(z) = -F-\z) (zed), (5) 

a 

where F ~ 1 is the mapping of Q onto U which inverts F, and we put 

b = f. f zg(z) dz, (6) 

2ni J r 

where T is the positively oriented circle with center 0 and radius r. 

By (4), Theorem 14.15 can be applied to F/a. It asserts that the diameter 
of the complement of ( F/a)(U ) is at most 4. Therefore diam E <4\a\. Since 
diam E > r, it follows that 



(7) 
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Since g is a conformal mapping of Q onto D( 0; 1/ 1 a | ), (7) shows that 

\g(z)\<- r (zeCl) (8) 

and since T is a path in Q, of length 2t ir, (6) gives 

|h|<4r. (9) 

If C e D 9 then | £ | < r, so (1), (8), and (9) imply 

iei^ + 5^)<^. do) 

This proves (2). 

Fix CeD. 

If z = F(w), then zg(z) = wF(w)/a; and since wF(w)— > a as w-> 0, we have 
zg(z)— > 1 as z— > oo. Hence g has an expansion of the form 

i") 

Let T 0 be a large circle with center at 0; (11) gives (by Cauchy’s theorem) that 

* 2 (0 = L [ (z ~ 09(0 dz = b-0 (12) 

Substitute this value of A 2 (0 into (11). Then (1) shows that the function 

<p(0 = ^(C, z) - ~-^(z - 0 3 (13) 

is bounded as z— > oo. Hence <p has a removable singularity at oo. If 
z e Cl n D, then | z — Cl < 2r, so (2) and (13) give 

| q>{z) | < 8r 3 1 Q(C, z) | + 4 r 2 < l,000r 2 . (14) 

By the maximum modulus theorem, (14) holds for all z e Cl. This proves (3). 

//// 


20.3 Lemma Suppose f e C' c (R 2 ), the space of all continuously differentiable 
functions in the plane, with compact support. Put 


= 1 (d_ .d\ 

2 + 1 dyf 


( 1 ) 


Then the following “Cauchy formula ” holds: 

mo 


no 


= _i rr ® 

71 J JR2 c — 


dC dr, (C = c + if])- 


( 2 ) 
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Proof This may be deduced from Green’s theorem. However, here is a 
simple direct proof: 

Put <p(r, 0) =f(z + re ie ) 9 r > 0, 0 real. If £ = z + re i0 9 the chain rule gives 




The right side of (2) is therefore equal to the limit, as e— ► 0, of 

* 2n / dip i dip ^ 


_L r°° r 2 
2n 1 Jo 


~ ^ I ^ ^ r * 

dr r dd j 


( 3 ) 


( 4 ) 


For each r > 0, ip is periodic in 0, with period 271. The integral of d<p/00 is 
therefore 0, and (4) becomes 


* r°° dip i c 2n 

d9 [ i dr = Tn \ 0 ^ 6)dd - 

As e— ► 0, <p(e, 0)— >/(z) uniformly. This gives (2). 


_1_ 

2n 


(5) 

III / 


We shall establish Tietze’s extension theorem in the same setting in which we 
proved Urysohn’s lemma, since it is a fairly direct consequence of that lemma. 

20.4 Tietze’s Extension Theorem Suppose K is a compact subset of a locally 
compact Hausdorjf space X 9 and f e C(K). Then there exists an F e C c (X) such 
that F(x ) = f(x) for all x e K. 


(As in Lusin’s theorem, we can also arrange it so that \\F\\ X = \\f\\ K .) 


Proof Assume / is real, —1 </< 1. Let W be an open set with compact 
closure so that K a W. Put 

K + ={xe K:f(x) > *}, K~ = {x e K:f(x) < -±}. (1) 

Then K + and K~ are disjoint compact subsets of W. As a consequence of 
Urysohn’s lemma there is a function / x e C c (X) such that ffx) = j on K + , 
/ x (x) = — j on K~ 9 — j < ffx) < j for all x e X 9 and the support of / x lies in 
W. Thus 

\f-h\<i on K, I /i I < i on AT. (2) 

Repeat this construction with f —f x in place of / : There exists an f 2 e 
C c (X) 9 with support in W 9 so that 

l/-/i-/2l<(!) 2 onK, I/2 1 ^ 3 • § on X. ( 3 ) 

In this way we obtain functions /„ e C c (X) 9 with supports in W 9 such that 

I /-/1 /„ I < (!)" on K, | /J < i • (fr 1 on X (4) 
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Put F = / x + f 2 + / 3 + * * * . By (4), the series converges to / on K , and it 
converges uniformly on X. Hence F is continuous. Also, the support of F lies 
in W. HU 

Mergelyan’s Theorem 

20.5 Theorem If K is a compact set in the plane whose complement is con- 
nected , if f is a continuous complex function on K which is holomorphic in the 
interior of K , and if e > 0, then there exists a polynomial P such that 
| f(z) - P(z) | < efor all z e K. 


If the interior of K is empty, then part of the hypothesis is vacuously satis- 
fied, and the conclusion holds for every / e C(K). Note that K need not be con- 
nected. 

Proof By Tietze’s theorem, / can be extended to a continuous function in 
the plane, with compact support. We fix one such extension, and denote it 
again by /. 

For any 6 > 0, let co(<5) be the supremum of the numbers 

l/(*2)-/(*l)l 

where z x and z 2 are subject to the condition \z 2 — z t | < <5. Since / is uni- 
formly continuous, we have 


lim co(<5) = 0. (1) 

< 5-+0 

From now on, S will be fixed. We shall prove that there is a polynomial 
P such that 


| f(z) - P(z) I < 10,000co(<5) (z e K). (2) 

By (1), this proves the theorem. 

Our first objective is the construction of a function O e C' c {R 2 ), such that 
for all z 


\Az)-m\<co(s), 

( 3 ) 


( 4 ) 


4>(2)= ~n\\^7 didri iC = ^ + ^ 
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where X is the set of all points in the support of O whose distance from the 
complement of K does not exceed 5. (Thus X contains no point which is “ far 
within ” K .) 

We construct O as the convolution of / with a smoothing function A. Put 
a{r) = 0 if r > <5, put 



(0 < r < <5), 


( 6 ) 


and define 


A(z) = a(\z\) 


for all complex z. It is clear that A e C' c (R 2 ). We claim that 


II 


A 


= 1 , 



= 0, 

24 2 

" 15(5 < <5 


( 7 ) 


( 8 ) 

( 9 ) 

( 10 ) 


The constants are so adjusted in (6) that (8) holds. (Compute the integral 
in polar coordinates.) (9) holds simply because A has compact support. To 
compute (10), express dA in polar coordinates, as in the proof of Lemma 20.3, 
and note that dA/dO = 0, | dA/dr | = — a'(r). 

Now define 


O(z) 


-II 

R2 


f(z - QA( C) d£ dr, 


-H* 2 - 

R2 


0/(0 di dr,. 


( 11 ) 


Since / and A have compact support, so does O. Since 


«*z) -m = 


[/(2 - 0 -mV 1(0 dt dr, 


( 12 ) 


R2 


and A(Q = 0 if | Cl > <5, (3) follows from (8). The difference quotients of A 
converge boundedly to the corresponding partial derivatives of A, since 
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A e C'(R 2 ). Hence the last expression in (11) may be differentiated under the 
integral sign, and we obtain 


(3®X*) = 


II 


(dA\z - 0/(0 dt dr, 


= 11 
R2 


f(z - CX^XO dZ dr, 


= || n/(z - o -/mm o dn d n . 

R2 


(13) 


The last equality depends on (9). Now (10) and (13) give (4). If we write 
(13) with <t> x and <f> y in place of dO, we see that O has continuous partial 
derivatives. Hence Lemma 20.3 applies to O, and (5) will follow if we can 
show that dO = 0 in G, where G is the set of all z e K whose distance from 
the complement of K exceeds 3 . We shall do this by showing that 

0(z)=/(z) (z e G); (14) 

note that df = 0 in G, since / is holomorphic there. (We recall that d is the 
Cauchy-Riemann operator defined in Sec. 11.1.) Now if z e G, then z — C is in 
the interior of K for all f with | f | <3. The mean value property for harmo- 
nic functions therefore gives, by the first equation in (11), 


m s 

0(z) = a(r)r dr 

Jo 


f f(z - re ie ) do 


Jo 


= 2nf(z) a(r)r dr =f(z) 
Jo 



A =f(z) 


(15) 


for all z e G. 

We have now proved (3), (4), and (5). 

The definition of X shows that X is compact and that X can be covered 
by finitely many open discs D u . . . , D n , of radius 23 , whose centers are not in 
K. Since S 2 — K is connected, the center of each Dj can be joined to oo by a 
polygonal path in S 2 — K. It follows that each Dj contains a compact con- 
nected set Ej , of diameter at least 23 , so that S 2 — E } is connected and so 
that K n Ej = 0. 

We now apply Lemma 20.2, with r = 23. There exist functions 
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Qj e H(S 2 — Ej) and constants bj so that the inequalities 



ie/c,z)i 

i 50 

!< 7’ 

(16) 


, i 

4,000(5 2 



e/c, 2 - , 

r i3 

(17) 


z-C 

1 z-CI 


hold for z ^ Ej and ( e Dj, if 



6/C, *) = flf/z) + (c - *>;)<#)• 

(18) 

Let Q be the complement of u • • • 

u £„ . Then Q is an 

open set which 

contains K. 




Put Xi=X n D 

, and X, = (Xn D.) - (Jf x u • • • 

uX;-,), for 

2 < j < n. Define 




r(c, z) = e/c, z) 

(C 6 X,, Z 6 fi) 

(19) 


and 


m 




{mom, z) di dr, (z e £}). 


( 20 ) 


Since 


" 1 

F(z)= Z 1 


i= i 


(3®X0 S/C, z) dr,. 


Xj 


( 21 ) 


(18) shows that F is a finite linear combination of the functions and gj. 
Hence F e H(Q). 

By (20), (4), and (5) we have 


I F(z) — 0(z) | < 


2co(d) 

ltd 


I 


R(C, z) - 


1 


Z-t 


di drj (z e Q). (22) 


Observe that the inequalities (16) and (17) are valid with jR in place of Qj if 
C e X and zefi. For if £ e X then ( e X .• for some /, and then R(C, z) = 
Q/C, z) for all zeQ. 

Now fixzefi, put f = z + and estimate the integrand in (22) by (16) 
if p < 45, by (17) if 45 < p. The integral in (22) is then seen to be less than the 
sum of 


2 *I 


(23) 
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and 


J *o° 4 qq(L 52 

* ^ p dp = 2,000nd. (24) 

4(5 P 

Hence (22) yields 

| F(z) - O(z) | < 6,OOOco(<5) (z e O). (25) 

Since F e H( Q), XcQ, and S 2 — K is connected, Runge’s theorem 
shows that F can be uniformly approximated on K by polynomials. Hence 
(3) and (25) show that (2) can be satisfied. 

This completes the proof. //// 

One unusual feature of this proof should be pointed out. We had to prove 
that the given function / is in the closed subspace P(K) of C(K). (We use the 
terminology of Sec. 20.1.) Our first step consisted in approximating / by <I>. But 
this step took us outside P(K ), since <t> was so constructed that in general will 
not be holomorphic in the whole interior of K. Hence <I> is at some positive 
distance from P(K). However, (25) shows that this distance is less than a constant 
multiple of co(<5). [In fact, having proved the theorem, we know that this distance 
is at most co(<5), by (3), rather than 6,000 cu(3).] The proof of (25) depends on the 
inequality (4) and on the fact that 30 = 0 in G. Since holomorphic functions cp 
are characterized by dip = 0, (4) may be regarded as saying that O is not too far 
from being holomorphic, and this interpretation is confirmed by (25). 


Exercises 


1 Extend Mergelyan’s theorem to the case in which S 2 — K has finitely many components: Prove 
that then every / e C{K ) which is holomorphic in the interior of K can be uniformly approximated on 
K by rational functions. 

2 Show that the result of Exercise 1 does not extend to arbitrary compact sets K in the plane, by 
verifying the details of the following example. For n— 1, 2, 3, . . . , let D n — D( a„; r„) be disjoint open 
discs in U whose union V is dense in 17, such that Er„ < oo. Put K — U — V. Let T and y n be the 
paths 


and define 


r(t) = e l \ y n {t) = a „ + r„ 0 < t < In, 


Uf) = JV(z) dz - 


Z f/M* 


(/e C(K)). 


Prove that L is a bounded linear functional on C(K), prove that L(R) = 0 for every rational function 
R whose poles are outside K, and prove that there exists an / e C{K ) for which L(/) # 0. 

3 Show that the function g constructed in the proof of Lemma 20.2 has the smallest supremum norm 
among all / e H(Q) such that z/(z)— ► 1 as z— ► oo. (This motivates the proof of the lemma.) 

Show also that b = c 0 in that proof and that the inequality | b | < 4r can therefore be replaced by 
| b | < r. In fact, b lies in the convex hull of the set E. 



APPENDIX 

HAUSDORFFS MAXIMALITY THEOREM 


We shall first prove a lemma which, when combined with the axiom of choice, 
leads to an almost instantaneous proof of Theorem 4.21. 

If & is a collection of sets and <t> c & 9 we call <5> a subchain of & provided 
that <5> is totally ordered by set inclusion. Explicitly, this means that if A e $ and 
B e then either A c B or B c A. The union of all members of Q> will simply be 
referred to as the union of Q>. 

Lemma Suppose 3F is a nonempty collection of subsets of a set X such that the 
union of every subchain of 2F belongs to 3F . Suppose g is a function which 
associates to each A e 3F a set g(A) e 3F such that A a g(A) and g(A) — A 
consists of at most one element. Then there exists an A e 3F for which 
g(A) = A. 

Proof Fix A 0 e 3F . Call a subcollection of & a tower if has the fol- 
lowing three properties: 

(a) A 0 e^\ 

(b) The union of every subchain of $F’ belongs to 

(c) If A e then also g(A) e 

The family of all towers is nonempty. For if IF ± is the collection of all 
A e 3F such that A 0 cz A , then ^ is a tower. Let be the intersection of 
all towers. Then & 0 is a tower (the verification is trivial), but no proper 
subcollection of IF 0 is a tower. Also, A 0 a A if A e F 0 . The idea of the 
proof is to show that F 0 is a subchain of F . 
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Let T be the collection of all C e such that every A e & 0 satisfies 
either A cz C or C a A. 

For each CeT, let 0(C) be the collection of all A e & 0 such that either 
A cz C or g(C) cz A. 

Properties (a) and (b) are clearly satisfied by T and by each 0(C). Fix 
C e T, and suppose A e O(C). We want to prove that g(A) e O(C). If 
A e O(C), there are three possibilities: Either A cz C and A ^ C, or A = C, or 
g(C) cz A. If A is a proper subset of C, then C cannot be a proper subset of 
g(A\ otherwise g(A) — A would contain at least two elements; since C e T, it 
follows that g(A) cz C. If A = C, then g(A) = g(C). If g(C) cz A, then also 
g(C) cz g(A), since A cz g(A). Thus g(A) e 0(C), and we have proved that O(C) 
is a tower. The minimality of & 0 implies now that 0(C) = , for every 

CeT. 

In other words, if A e and CeT, then either A cz C or g(C) cz A. But 
this says that g(C) e T. Hence T is a tower, and the minimality of & 0 shows 
that T = . It follows now from the definition of T that is totally 

ordered. 

Let A be the union of o . Since o satisfies (b), A e . By (c), g(A) e 

. Since A is the largest member of 2F ^ and since A c= g(A), it follows that 
A = g(A). //// 

Definition A choice function for a set X is a function / which associates to 
each nonempty subset E of X an element of E:f(E) e E. 

In more informal terminology, /“ chooses ” an element out of each non- 
empty subset of X. 

The Axiom of Choice For every set there is a choice function. 

HausdorfFs Maximality Theorem Every nonempty partially ordered set P 
contains a maximal totally ordered subset. 

Proof Let & be the collection of all totally ordered subsets of P. Since every 
subset of P which consists of a single element is totally ordered, is not 
empty. Note that the union of any chain of totally ordered sets is totally 
ordered. 

Let / be a choice function for P. If A e let A* be the set of all x in the 
complement of A such that A u {x} e If A* ^ 0, put 

g(A) = A u {f (A*)}. 

If A* = 0, put g(A) = A. 

By the lemma, A* = 0 for at least one A e and any such A is a 
maximal element of <F. //// 



NOTES AND COMMENTS 


Chapter 1 

The first book on the modern theory of integration and differentiation is Lebesgue’s “Le9ons sur 
Integration, ” published in 1904. 

For an illuminating history of the earlier attempts that were made toward the construction of a 
satisfactory theory of integration, see [42]f, which contains interesting discussions of the difficulties 
that even first-rate mathematicians had with simple set-theoretic concepts before these were properly 
defined and well understood. 

The approach to abstract integration presented in the text is inspired by Saks [28]. Greater 
generality can be attained if <r-algebras are replaced by <r-rings ( Axioms : (J A ( e 0t and A x — A 2 e 
if A t e 0t for i = 1, 2, 3, . it is not required that X e but at the expense of a necessarily fussier 
definition of measurability. See Sec. 18 of [7]. In all classical applications the measurability of X is 
more or less automatic. This is the reason for our choice of the somewhat simpler theory based on 
^-algebras. 

Sec. 1.11. This definition of 06 is as in [12]. In [7] the Borel sets are defined as the c-ring 
generated by the compact sets. In spaces which are not ^-compact, this is a smaller family than ours. 

Chapter 2 

Sec. 2.12. The usual statement of Urysohn’s lemma is: If K 0 and K x are disjoint closed sets in a 
normal Hausdorff space X , then there is a continuous function on X which is 0 on K 0 and 1 on ^ 
The proof is exactly as in the text. 

Sec. 2.14. The original form of this theorem, with X = [0, 1], is due to F. Riesz (1909). See [5], 
pp. 373, 380-381, and [12], pp. 134-135 for its further history. The theorem is here presented in the 
same generality as in [12]. The set function fi which is defined for all subsets of X in the proof of 
Theorem 2.14 is called an outer measure because of its countable subadditivity (Step I). For systematic 
exploitations (originated by Caratheodory) of this notion, see [25] and [28]. 

Sec. 2.20. For direct constructions of Lebesgue measure, along more classical lines, see [31], 
[35], [26] and [53]. 

t Numbers in brackets refer to the Bibliography. 
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Sec. 2.21. A proof that the cardinality of every countably generated <r-algebra is <,c may be 
found on pp. 133-134 of [44]. That this cardinality is either finite or >c should be clear after doing 
Exercise 1 of Chap. 1. 

Sec. 2.22. A very instructive paper on the subject of nonmeasurable sets in relation to measures 
invariant under a group is: J. von Neumann, Zur allgemeinen Theorie des Masses, Fundamenta 
Math., vol. 13, pp. 73-116, 1929. See also Halmos’s article in the special (May, 1958) issue of Bull Am. 
Math. Soc. devoted to von Neumann’s work. 

Sec. 2.24. [28], p. 72. 

Sec. 2.25. [28], p. 75. There is another approach to the Lebesgue theory of integration, due to 
Daniell (Ann. Math., vol. 19, pp. 279-29 4, 1917-1918) based on extensions of positive linear function- 
als. When applied to C c (X ) it leads to a construction which virtually turns the Vitali-Caratheodory 
theorem into the definition of measurability. See [17] and, for the full treatment, [18]. 

Exercise 8. Two interesting extensions of this phenomenon have appeared in Amer. Math. 
Monthly : see vol. 79, pp. 884-886, 1972 (R. B. Kirk) and vol. 91, pp. 564-566, 1984 (F. S. Cater). 

Exercise 17. This example appears in A Theory of Radon Measures on Locally Compact 
Spaces, by R. E. Edwards, Acta Math., vol. 89, p. 160, 1953. Its existence was unfortunately over- 
looked in [27]. 

Exercise 18. [7], p. 231 ; originally due to Dieudonne. 

Chapter 3 

The best general reference is [9]. See also Chap. 1 of [36]. 

Exercise 3. Vol. 1 (1920) of Fundamenta Math, contains three papers relevant to the parentheti- 
cal remark. 

Exercise 7. A very complete answer to this question was found by A. Villani, in Am. Math. 
Monthly , vol. 92, pp. 485-487, 1985. 

Exercise 16. [28], p. 18. When t ranges over all positive real numbers, a measurability problem 
arises in the suggested proof. This is the reason for including (ii) as a hypothesis. See W. Walter, Am. 
Math. Monthly, vol. 84, pp. 118-119, 1977. 

Exercise 17. The second suggested proof of part ( b ) was published by W. P. Novinger, in Proc. 
Am. Math. Soc., vol. 34, pp. 627-628, 1972. It was also discovered by David Hall. 

Exercise 18. Convergence in measure is a natural concept in probability theory. See [7], 
Chap. IX. 

Chapter 4 

There are many books dealing with Hilbert space theory. We cite [6] and [24] as main references. See 
also [5], [17], and [19]. 

The standard work on Fourier series is [36]. For simpler introductions, see [10], [31], [43], and 

[45]. 

Exercise 2. This is the so-called Gram-Schmidt orthogonalization process. 

Exercise 18. The functions that are uniform limits (on R 1 ) of members of X are called almost 
periodic. 

Chapter 5 

The classical work here is [2]. More recent treatises are [5], [14], and [24]. See also [17] and [49]. 

Sec. 5.7. The relations between measure theory on the one hand and Baire’s theorem on the 
other are discussed in great detail in [48]. 

Sec. 5.11. Although there are continuous functions whose Fourier series diverge on a dense G s , 
the set of divergence must have measure 0. This was proved by L. Carleson (Acta Math., vol. 116, pp. 
135-157, 1966) for all /in L 2 (T); the proof was extended to Z?(T), p > 1, by R. A. Hunt. See also [45], 
especially Chap. II. 

Sec. 5.22. For a deeper discussion of representing measures see Arens and Singer, Proc. Am. 
Math. Soc., vol. 5, pp. 735-745, 1954. Also [39], [52]. 



NOTES AND COMMENTS 399 


Chapter 6 

Sec. 6.3. The constant 1 /n is best possible. See R. P. Kaufman and N. W. Rickert, Bull . Am. Math. 
Soc., vol. 72, pp. 672-676, 1966, and (for a simpler treatment) W. W. Bledsoe, Am. Math. Monthly , vol. 
77, pp. 180-182, 1970. 

Sec. 6.10. von Neumann’s proof is in a section on measure theory in his paper: On Rings of 
Operators, III, Ann. Math., vol. 41, pp. 94-161, 1940. See pp. 124-130. 

Sec. 6.15. The phenomenon L 00 # (L 1 )* is discussed by J. T. Schwartz in Proc. Am. Math. Soc., 
vol. 2, pp. 270-275, 1951, and by H. W. Ellis and D. O. Snow in Can. Math. Bull., vol. 6, pp. 211-229, 
1963. See also [7], p. 131, and [28], p. 36. 

Sec. 6.19. The references to Theorem 2.14 apply here as well. 

Exercise 6. See [17], p. 43. 

Exercise 10(0). See [36], vol. I, p. 167. 


Chapter 7 

Sec. 7.3. This simple covering lemma seems to appear for the first time in a paper by Wiener on the 
ergodic theorem (Duke Math. J., vol. 5, pp. 1-18, 1939). Covering lemmas play a central role in the 
theory of differentiation. See [50], [53], and, for a very detailed treatment, [41]. 

Sec. 7.4. Maximal functions were first introduced by Hardy and Littlewood, in Acta Math., vol. 
54, pp. 81-116, 1930. That paper contains proofs of Theorems 8.18, 11.25(6), and 17.11. 

Sec. 7.21. The same conclusion can be obtained under somewhat weaker hypotheses; see [16], 
Theorems 260-264. Note that the proof of Theorem 7.21 uses the existence and integrability of only 
the right-hand derivative of /, plus the continuity of /. For a further refinement, see P. L. Walker, 
Amer. Math. Monthly, vol. 84, pp. 287-288, 1977. 

Secs. 7.25, 7.26. This treatment of the change of variables formula is quite similar to D. 
Varberg’s in Amer. Math. Monthly, vol. 78, pp. 42-45, 1971. 

Exercise 5. A very simple proof, due to K. Stromberg, is in Proc. Amer. Math. Soc., vol. 36, 
p. 308, 1972. 

Exercise 12. For an elementary proof that every monotone function (hence every function of 
bounded variation) is differentiable a.e., see [24], pp. 5-9. In that work, this theorem is made the 
starting point of the Lebesgue theory. Another, even simpler, proof by D. Austin is in Proc. Amer. 
Math. Soc., vol. 16, pp. 220-221, 1965. 

Exercise 18. These functions q> n are the so-called Rademacher functions. Chap. V of [36] con- 
tains further theorems about them. 


Chapter 8 

Fubini’s theorem is developed here as in [7] and [28]. For a different approach, see [25]. Sec. 8.9(c) is 
in Fundament a Math., vol. 1, p. 145, 1920. 

Sec. 8.18. This proof of the Hardy-Littlewood theorem (see the reference to Sec. 7.4) is essen- 
tially that of a very special case of the Marcinkiewicz interpolation theorem. A full discussion of the 
latter may be found in Chap. XII of [36]. See also [50]. 

Exercise 2. Corresponding to the idea that an integral is an area under a curve, the theory of the 
Lebesgue integral can be developed in terms of measures of ordinate sets. This is done in [16]. 

Exercise 8. Part ( b ), in even more precise form, was proved by Lebesgue in J. Mathematiques, 
ser. 6, vol. 1, p. 201, 1905, and seems to have been forgotten. It is quite remarkable in view of another 
example of Sierpinski ( Fundamenta Math., vol. 1, p. 114, 1920): There is a plane set E which is not 
Lebesgue measurable and which has at most two points on each straight line. If/ = Xe > then / not 
Lebesgue measurable, although all of the sections f x and f y are upper semicontinuous; in fact, each 
has at most two points of discontinuity. (This example depends on the axiom of choice, but not on the 
continuum hypothesis.) 
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Chapter 9 

For another brief introduction, see [36], chap. XVI. A different proof of Plancherel’s theorem is in 
[33]. Group-theoretic aspects and connections with Banach algebras are discussed in [17], [19], and 
[27]. For more on invariant subspaces (Sec. 9.16) see [11]; the corresponding problem in L 1 is 
described in [27], Chap. 7. 


Chapter 10 

General references: [1], [4], [13], [20], [29], [31] and [37]. 

Sec. 10.8. Integration can also be defined over arbitrary rectifiable curves. See [13], vol. I, 
Appendix C. 

Sec. 10.10. The topological concept of index is applied in [29] and is fully utilized in [1]. The 
computational proof of Theorem 10.10. is as in [1], p. 93. 

Sec. 10.13. Cauchy published his theorem in 1825, under the additional assumption that /' is 
continuous. Goursat showed that this assumption is redundant, and stated the theorem in its present 
form. See [13], p. 163, for further historical remarks. 

Sec. 10.16. The standard proofs of the power series representation and of the fact that / e H(Q) 
implies /' e H(Q) proceed via the Cauchy integral formula, as here. Recently proofs have been con- 
structed which use the winding number but make no appeal to integration. For details see [34]. 

Sec. 10.25. A very elementary proof of the algebraic completeness of the complex field is in [26], 
p. 170. 

Sec. 10.30. The proof of part ( b ) is as in [47]. 

Sec. 10.32. The open mapping theorem and the discreteness of Z(f) are topological properties of 
the class of all nonconstant holomorphic functions which characterize this class up to homeomorp- 
hisms. This is Stoilov’s theorem. See [34]. 

Sec. 10.35. This strikingly simple and elementary proof of the global version of Cauchy’s 
theorem was discovered by John D. Dixon, Proc. Am. Math. Soc ., vol. 29, pp. 625-626, 1971. In [1] 
the proof is based on the theory of exact differentials. In the first edition of the present book it was 
deduced from Runge’s theorem. That approach was used earlier in [29], p. 177. There, however, it 
was applied in simply connected regions only. 


Chapter 1 1 

General references: [1], Chap. 5; [20], Chap. 1. 

Sec. 11.14. The reflection principle was used by H. A. Schwarz to solve problems concerning 
conformal mappings of polygonal regions. See Sec. 17.6 of [13]. Further results along these lines were 
obtained by Caratheodory; see [4], vol. II, pp. 88-92, and Commentarii Mathematici Helvetia , vol. 19, 
pp. 263-278, 1946-1947. 

Secs. 11.20, 11.25. This is the principal result of the Hardy-Littlewood paper mentioned in the 
reference to Sec. 7.4. The proof of the second inequality in Theorem 11.20 is as in [40], p. 23. 

Sec. 11.23. The first theorems of this type are in Fatou’s thesis, Series trigonometriques et series 
de Taylor, Acta Math., vol. 30, pp. 335-400, 1906. This is the first major work in which Lebesgue’s 
theory of integration is applied to the study of holomorphic functions. 

Sec. 11.30. Part (c) is due to Herglotz, Leipziger Berichte , vol. 63, pp. 501-511, 1911. 

Exercise 14. This was suggested by W. Ramey and D. Ullrich. 


Chapter 12 

Sec. 12.7. For further examples, see [31], pp. 176-186. 

Sec. 12.11. This theorem was first proved for trigonometric series by W. H. Young (1912; q = 2, 
4, 6, ...) and F. Hausdorff (1923; 2 < q < oo). F. Riesz (1923) extended it to uniformly bounded 
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orthonormal sets, M. Riesz (1926) derived this extension froih a general interpolation theorem, and G. 
O. Thorin (1939) discovered the complex-variable proof of M. Riesz’s theorem. The proof of the text is 
the Calderon-Zygmund adaptation (1950) of Thorin’s idea. Full references and discussions of other 
interpolation theorems are in Chap. XII of [36]. 

Sec. 12.13. In slightly different form, this is in Duke Math. J., vol. 20, pp. 449-458, 1953. 

Sec. 12.14. This proof is essentially that of R. Kaufman (Math. Ann., vol. 169, p. 282, 1967). E. L. 
Stout (Math. Ann., vol. 177, pp. 339-340, 1968) obtained a stronger result. 

Chapter 13 

Sec. 13.9. Runge’s theorem was published in Acta Math., vol. 6, 1885. (Incidentally, this is the same 
year in which the Weierstrass theorem on uniform approximation by polynomials on an interval was 
published; see Mathematische Werke, vol. 3, pp. 1-37.) See [29], pp. 171-177, for a proof which is 
close to the original one. The functional analysis proof of the text is known to many analysts and has 
probably been independently discovered several times in recent years. It was communicated to me by 
L. A. Rubel. In [13], vol. II, pp. 299-308, attention is paid to the closeness of the approximation if the 
degree of the polynomial is fixed. 

Exercises 5, 6. For yet another method, see D. G. Cantor, Proc. Am. Math. Soc., vol. 15, pp. 
335-336, 1964. 

Chapter 14 

General reference: [20]. Many special mapping functions are described there in great detail. 

Sec. 14.3. More details on linear fractional transformations may be found in [1], pp. 22-35; in 
[13], pp. 46-57; in [4]; and especially in Chap. 1 of L. R. Ford’s book “Automorphic Functions,” 
McGraw-Hill Book Company, New York, 1929. 

Sec. 14.5. Normal families were introduced by Montel. See Chap. 15 of [13]. 

Sec. 14.8. The history of Riemann’s theorem is discussed in [13], pp. 320-321, and in [29], 
p. 230. Koebe’s proof (Exercise 26) is in J. fur reine und angew. Math., vol. 145, pp. 177-223, 1915; 
doubly connected regions are also considered there. 

Sec. 14.14. Much more is true than just | a 2 \ < 2: In fact, | a n \ < n for all n > 2. This was conjec- 
tured by Bieberbach in 1916, and proved by L. de Branges in 1984 \_Acta Math., vol. 154, pp. 137-152, 
(1985)]. Moreover, if | a n \ = n for just one n > 2, then / is one of the Koebe functions of Example 
14.11. 

Sec. 14.19. The boundary behavior of conformal mappings was investigated by Caratheodory in 
Math. Ann., vol. 73, pp. 323-370, 1913. Theorem 14.19 was proved there for regions bounded by 
Jordan curves, and the notion of prime ends was introduced. See also [4], vol. II, pp. 88-107. 

Exercise 24. This proof is due to Y. N. Moschovakis. 

Chapter 15 

Sec. 15.9. The relation between canonical products and entire functions of finite order is discussed in 
Chap. 2 of [3], Chap. VII of [29], and Chap. VIII of [31]. 

Sec. 15.25. See Szasz, Math. Ann., vol. 77, pp. 482-496, 1916, for further results in this direction. 
Also Chap. II of [21]. 

Chapter 16 

The classical work on Riemann surfaces is [32]. (The first edition appeared in 1913.) Other references: 
Chap. VI of [1], Chap. 10 of [13], Chap. VI of [29], and [30]. 

Sec. 16.5. Ostrowski’s theorem is in J. London Math. Soc., vol. 1, pp. 251-263, 1926. See J.-P. 
Kahane, Lacunary Taylor and Fourier Series, Bull. Am. Math. Soc., vol. 70, pp. 199-213, 1964, for a 
more recent account of gap series. 

Sec. 16.15. The approach to the monodromy theorem was a little simpler in the first edition of 
this book. It used the fact that every simply connected plane region is homeomorphic to a convex 
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one, namely U. The present proof is so arranged that it applies without change to holomorphic 
functions of several complex variables. (Note that when k > 2 there exist simply connected open sets 
in R k which are not homeomorphic to any convex set; spherical shells furnish examples of this.) 

Sec. 16.17. Chap. 13 of [13], Chap. VIII of [29], and part 7 of [4]. 

Sec. 16.21. Picard’s big theorem is proved with the aid of modular functions in part 7 of [4]. 
“Elementary” proofs may be found in [31], pp. 277-284, and in Chap. VII of [29]. 

Exercise 10. Various classes of removable sets are discussed by Ahlfors and Beurling in Confor- 
mal Invariants and Function-theoretic Null-Sets, Acta Math., vol. 83, pp. 101-129, 1950. 


Chapter 17 

The classical reference here is [15]. See also [36], Chap. VII. Although [15] deals mainly with the 
unit disc, most proofs are so constructed that they apply to other situations which are described there. 
Some of these generalizations are in Chap. 8 of [27]. Other recent books on these topics are [38], 
[40], and [46]. 

Sec. 17.1. See [22] for a thorough treatment of subharmonic functions. 

Sec. 17.13. For a different proof, see [15], or the paper by Helson and Lowdenslager in Acta 
Math., vol. 99, pp. 165-202, 1958. An extremely simple proof was found by B. K. 0ksendal, in Proc. 
Amer. Math. Soc ., vol. 30, p. 204, 1971. 

Sec. 17.14. The terms “inner function” and “outer function” were coined by Beurling in the 
paper in which Theorem 17.21 was proved: On Two Problems Concerning Linear Transformations in 
Hilbert Space, Acta Math., vol. 81, pp. 239-255, 1949. For further developments, see [11]. 

Secs. 17.25, 17.26. This proof of M. Riesz’s theorem is due to A. P. Calderon. See Proc. Am. 
Math. Soc., vol. 1, pp. 533-535, 1950. See also [36], vol. I, pp. 252-262. 

Exercise 3. This forms the basis of a definition of H p -spaces in other regions. See Trans. Am. 
Math. Soc., vol. 78, pp. 46-66, 1955. 


Chapter 18 

General references: [17], [19], and [23]; also [14]. The theory was originated by Gelfand in 1941. 

Sec. 18.18. This was proved in elementary fashion by P. J. Cohen in Proc. Am. Math. Soc., vol. 
12, pp. 159-163, 1961. 

Sec. 18.20. This theorem is Wermer’s, Proc. Am. Math. Soc., vol. 4, pp. 866-869, 1953. The proof 
of the text is due to Hoffman and Singer. See [15], pp. 93-94, where an extremely short proof by P. J. 
Cohen is also given. (See the reference to Sec. 18.18.) 

Sec. 18.21. This was one of the major steps in Wiener’s original proof of his Tauberian theorem. 
See [33], p. 91. The painless proof given in the text was the first spectacular success of the Gelfand 
theory. 

Exercise 14. The set A can be given a compact Hausdorff topology with respect to which the 
function x is continuous. Thus x -* x is a homomorphism of A into C(A). This representation of A as 
an algebra of continuous functions is a most important tool in the study of commutative Banach 
algebras. 


Chapter 19 

Secs. 19.2, 19.3: [21], pp. 1-13. See also [3], where functions of exponential type are the main subject. 

Sec. 19.5. For a more detailed introduction to the classes C{M n }, see S. Mandelbrojt, “Series de 
Fourier et classes quasi-analytiques,” Gauthier- Villars, Paris, 1935. 

Sec. 19.11. In [21], the proof of this theorem is based on Theorem 19.2 rather than on 19.3. 
Exercise 4. The function <I> is called the Borel transform of /. See [3], Chap. 5. 

Exercise 12. The suggested proof is due to H. Mirkil, Proc. Am. Math. Soc., vol. 7, pp. 650-652, 
1956. The theorem was proved by Borel in 1895. 
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Chapter 20 

See S. N. Mergelyan, Uniform Approximations to Functions of a Complex Variable, Uspehi Mat. 
Nauk (N.S.) 7, no. 2 (48), 31-122, 1952; Amer. Math. Soc. Translation No. 101, 1954. Our Theorem 
20.5 is Theorem 1.4 in Mergelyan’s paper. 

A functional analysis proof, based on measure-theoretic considerations, has recently been 
published by L. Carleson in Math. Scandinavica , vol. 15, pp. 167-175, 1964. 

Appendix 

The maximality theorem was first stated by Hausdorff on p. 140 of his book “Grundziige der 
Mengenlehre,” 1914. The proof of the text is patterned after Sec. 16 of Halmos’s book [8]. The 
idea to choose g so that g{A) — A has at most one element appears there, as does the term “ tower.” 
The proof is similar to one of Zermelo’s proofs of the well-ordering theorem; see Math. Ann., vol. 65, 
pp. 107-128, 1908. 
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LIST OF SPECIAL SYMBOLS AND 

ABBREVIATIONSf 


exp ( 2 ) 

1 

T 

8 

m 

8 

Xe 

11 

lim sup 

14 

lim inf 

14 

f\r 

15 


24 

a.e. 

27 

E 

35 

C c (X) 

38 

K<f<V 

39 


42 

m, m k 

51 

A (T) 

51, 150 

L l (R k ), L\E) 

53 

\\f\\ P , \\f\L 

65, 66 

U(n), L p (R k ), £ p 

65 

L°°(/x), E°(R k ), r j 

66 

C 0 (X), C(X) 

70 

(x, y) 

76 

11*11 

76 

x J. y, M 1 

79 


x(a) 

82 

T 

88 

Zf(T), C(T) 

88 

Z 

89 

m 

91 

Co 

104 

ll/ll* 

108 

U 

110 

P r (0 - t) 

111 

Lip a 

113 

l^l(£) 

116 

/A 

119 

A < H 

120 

Aj 1 A 2 

120 

B(x, r) 

136 

QrR 

136 

Du 

136, 241 

Mu 

136, 241 

Mf 

138 

AC 

145 

T'(x) 

150 

J y 

150 

BV 

157 


t The standard set-theoretic symbols are described on pages 6 to 8 and are not listed here. 
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E x , E> 

161 

PM 

240 


162 

a* 

240 

fi X X 

164 

N. 

241 

f *g 

171 

rad 

241 

n * A 

175 

<7 

241 

fit) 

178 

i/°° 

248 

C 00 

194 

/V') 

249 

c* 

194 

<P&) 

254 

D(a ; r), D'(a; r), £)(a; r) 

196 

S 2 

266 

il 

197 

Sf 

285 

H(«) 

197 

E p (z) 

301 

7 , y* 

200 

log + f 

311 

dA 

202 

N 

311 

Ind y (z) 

203 

(/o.J>o)~(/i. *>l) 

323 

Z(f) 

208 

H” 

338 

+ 

217 

M„Q, 

344 

I 2 = I x I 

222 

ff(x) 

357 

d, 8 

231 

p(x) 

360 

A 

232 

C{M n } 

377 

P[/] 

233 

P(K) 

386 

n + , n- 

237 

C c {R 2 ) 

388 



INDEX 


Absolute continuity, 120 
of functions, 145 
Absolute convergence, 116 
Absolutely convergent Fourier series, 367 
Addition formula, 1 
Affine transformation, 377 
Ahlfors, L. V., 402 
Algebra, 356 
of measures, 175 
of sets, 10 

Algebraically closed field, 213 

Almost everywhere, 27 

Almost periodic function, 94 

Almost uniform convergence, 214 

Analytic continuation, 323, 377, 379 

Analytic function, 197 

Annulus, 229, 264, 292 

Approach region, 240 

Area, 250 

Area theorem, 286 

Arens, R., 398 

Argument, 204 

Arithmetic mean, 63 

Arzela-Ascoli theorem, 245 

Associative law, 18 

Asymptotic value, 265 

Austin, D., 399 

Average, 30 

Axiom of choice, 396 

Baire’s theorem, 97 
Balanced collection, 268 


Ball, 9 

Banach, S„ 105 

Banach algebra, 190, 356 

Banach space, 95 

Banach-Steinhaus theorem, 98 

Bessel’s inequality, 85, 260 

Beurling, A., 335, 402 

Beurling’s theorem, 348 

Bieberbach conjecture, 401 

Blaschke product, 310, 317, 318, 338, 353 

Bledsoe, W. W., 399 

Borel, E., 5, 402 

Borel function, 12 

Borel measure, 47 

Borel set, 12 

Borel transform, 402 

Boundary, 108, 202, 289 

Bounded function, 66 

Bounded linear functional, 96, 113, 127, 130 
Bounded linear transformation, 96 
Bounded variation, 117, 148, 157 
Box, 50 

Brouwer’s fixed point theorem, 151 

Calderon, A. R, 401, 402 
Cancellation law, 19 
Canonical product, 302 
Cantor, D. G., 401 
Cantor set, 58, 145 
Caratheodory, C., 397, 400, 401 
Carleson, L., 398 
Carrier, 58 
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Cartesian product, 7, 160 
Category theorem, 98 
Cater, F. S., 398 
Cauchy, A., 400 

Cauchy formula, 207, 219, 229, 268, 341, 384, 
388 

Cauchy-Riemann equations, 232 

Cauchy sequence, 67 

Cauchy theorem, 205, 206, 207, 218 

Cauchy’s estimates, 213 

Cell, 50 

Chain, 218 

Chain rule, 197 

Change of variables, 153, 156 

Character, 179 

Characteristic function, 1 1 

Choice function, 396 

Class, 6 

Closed curve, 200 
Closed graph theorem, 114 
Closed path, 201 
Closed set, 13, 35 
Closed subspace, 78 
Closure, 35 
Cohen, P. J., 402 
Collection, 6 

Commutative algebra, 356 
Commutative law, 18 
Compact set, 35 
Complement, 7 
Complete measure, 28 
Complete metric space, 67 
Completion : 

of measure space, 28, 168 
of metric space, 70, 74 
Complex algebra, 356 
Complex field, 213, 359 
Complex homomorphism, 191, 362 
Complex-linear functional, 105 
Complex measure, 16, 116 
Complex vector space, 33 
Component, 197 
Composite function, 7 
Concentrated, 119 
Conformal equivalence, 282 
Conjugate exponents, 63 
Conjugate function, 350 
Connected set, 196 
Continuity, 8, 9 
Continuous function, 8 
Continuous linear functional, 81, 96, 113, 127, 
130 

Continuous measure, 175 
Continuum hypothesis, 167 


Convergence: 
dominated, 26 
in L p , 67 
in measure, 74 
monotone, 21 
almost uniform, 214 
uniform on compact subsets, 214 
weak, 245 

Convex function, 61 
Convex sequence, 410 
Convex set, 79 
Convexity theorem, 257 
Convolution, 170, 175, 178 
Coset, 362 
Cosine, 2, 265 
Countable additivity, 6, 16 
Counting measure, 17 
Cover, 35, 324 
Covering lemma, 137, 399 
Curve, 200 

with orthogonal increments, 94 
Cycle, 218 

Daniell, P. J., 398 
de Branges, L., 401 
Denjoy, A., 144 

Denjoy-Carleman theorem, 380 
Dense set, 58 
Derivative, 135, 197 
of Fourier transform, 179 
of function of bounded variation, 157 
of integral, 141 

of measure, 136, 142, 143, 241 
symmetric, 136 
of transformation, 150 
Determinant, 54 
Diagonal process, 246 
Dieudonne, J., 398 
Differentiable transformation, 150 
Differential, 150 
Direct continuation, 323 
Dirichlet kernel, 101 
Dirichlet problem, 235 
Disc, 9, 196 
Discrete measure, 175 
Disjoint sets, 7 
Distance function, 9 
Distribution function, 172 
Distributive law, 18 
Division algebra, 360, 

Dixon, J. D., 400 
Domain, 7 

Dominated convergence theorem, 26, 29, 
180 
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Double integral, 165 
Dual space, 108, 112, 127, 130 

Eberlein, W. F., 58 
Edwards, R. E., 398 
Egoroff’s theorem, 73 
Elementary factor, 301 
Elementary set, 161 
Ellipse, 287 
Ellis, H. W., 399 
Empty set, 6 
End point, 200 
Entire function, 198 
Equicontinuous family, 245 
Equivalence classes, 67 
Equivalent paths, 201 
Essential range, 74 
Essential singularity, 211, 267 
Essential supremum, 66 
Euclidean space, 34, 49 
Euclid’s inequality, 77 
Euler’s identity, 2 
Exponential function, 1, 198 
Exponential type, 372, 382, 383 
Extended real line, 7, 9 
Extension, 105 
Extension theorem, 105 
Extremal function, 255 
Extreme point, 251 

F a - set, 12 

Factorization, 303, 338, 344 
Family, 6 
Fatou, P., 400 

Fatou’s lemma, 23, 68, 309, 344 
Fatou’s theorem, 249 
Fejer kernel, 252 
Fejer’s theorem, 91 
Field, 394 

Finite additivity, 17 
First category, 98 

Fixed point, 151, 229, 247, 293, 297, 314, 318, 395 
Ford, L. R., 401 
Fourier coefficients, 82, 91 
of measure, 133 
Fourier series, 83, 91 
Fourier transform, 178 
Fubini’s theorem, 164, 168 
Function, 7 

absolutely continuous, 145 
analytic, 197 
Borel, 13 
bounded, 66 

of bounded variation, 148 


complex, 8 
continuous, 8 
convex, 61 
entire, 198 

essentially bounded, 66 
exponential, 1 

of exponential type, 372, 383 
harmonic, 232 
holomorphic, 197 
Lebesgue integrable, 24 
left-continuous, 157 
locally integrable, 194 
lower semicontinuous, 37 
measurable, 8, 29 
meromorphic, 224 
modular, 328 

nowhere differentiable, 1 14 
rational, 267 
real, 8 
simple, 15 
subharmonic, 335 
summable, 24 
upper semicontinuous, 37 
Function element, 323 
Functional: 
bounded, 96 
on C 0 , 130 
complex-linear, 105 
continuous, 96 
on Hilbert space, 81 
on L p , 127 
linear, 33 

multiplicative, 191, 364 
positive, 34 
real-linear, 105 
Functional analysis, 108 
Fundamental domain, 329 
Fundamental theorem of calculus, 144, 148 

G a -set, 12 

Gap series, 276, 321, 334, 354, 385 
Gelfand, I., 402 
Gelfand-Mazur theorem, 359 
Gelfand transform, 370 
Geometric mean, 63 
Goursat, E., 400 
Graph, 114, 174 
Greatest lower bound, 7 
Green’s theorem, 389 

Hadamard, J., 264, 321 

Hahn-Banach theorem, 104, 107, 270, 313, 359 
Hahn decomposition, 126 
Hall, D., 398 
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Halmos, P. R., 398, 403 
Hardy, G. H., 173, 335, 399 
Hardy’s inequality, 72, 177 
Harmonic conjugate, 350 
Harmonic function, 232 
Harmonic majorant, 352 
Harnack’s theorem, 236, 250 
Hausdorff, F., 12, 400 

Hausdorff maximality theorem, 87, 107, 195, 362, 
396 

Hausdorff separation axiom, 36 
Hausdorff space, 36 
Hausdorff- Young theorem, 261 
Heine-Borel theorem, 36 
Helson, H., 348 
Herglotz, G., 400 
Hilbert cube, 92 
Hilbert space, 77 
isomorphism, 86 
Hoffman, K., 402 
Holder’s inequality, 63, 66 
Holomorphic function, 197 
Homomorphism, 179, 191, 364, 402 
Homotopy, 222 
Hunt, R. A., 398 

Ideal, 175, 305, 362 
Image, 7 

Independent set, 82 
Index: 

of curve, 223, 230 
of cycle, 218 
of path, 203 
Infimum, 7 
Infinite product, 298 
Initial point, 200 
Inner factor, 344 
Inner function, 342 
Inner product, 76, 89 
Inner regular set, 47 
Integral, 19, 24 
Integration, 19 
over cycle, 218 
of derivative, 146, 148, 149 
over measurable set, 20 
by parts, 157 
over path, 200 

with respect to complex measure, 129 
Interior, 267 

Interpolation, 173, 260, 304 
Intersection, 6 
Interval, 7 

Invariant subspace, 188, 346 
Inverse image, 7 


Inverse mapping, 7, 215 
Inversion, 280 
Inversion formula, 181 
Inversion theorem, 185, 186 
Invertible element, 357 
Isolated singularity, 210, 266 
Isometry, 84 
Isomorphism, 86 
Iterated integral, 165 

Jacobian, 150, 250 
Jensen’s formula, 307 
Jensen’s inequality, 62 
Jordan, C., 5 
Jordan curve, 291 
Jordan decomposition, 119 

Kahane, J. P., 401 
Kaufman, R. P., 399, 437 
Kirk, R. B., 398 
Koebe function, 285, 401 

Laplace equation, 232 
Laplacian, 195 
Laurent series, 230 
Least upper bound, 7 
Lebesgue, H. J., 5, 21, 397, 399 
Lebesgue decomposition, 121 
Lebesgue integrable function, 24 
Lebesgue integral, 19 
Lebesgue measurable set, 51 
Lebesgue measure, 51 
Lebesgue point, 138, 159, 241 
Left-continuous function, 157 
Left-hand limit, 157 
Length, 159, 202 
Limit, pointwise, 14 
in mean, 67 

of measurable functions, 14 
in measure, 74 
Lindelof’s theorem, 259 
Linear combination, 82 
Linear fractional transformation, 279, 296 
Linear independence, 82 
Linear transformation, 33 
Linearly ordered set, 87 
Liou ville’s theorem, 212, 220, 359 
Lipschitz condition, 113 
Littlewood, J. E., 173, 399 
Locally compact space, 36 
Locally integrable function, 194 
Logarithm, 227, 274 
Lowdenslager, D., 348 
Lower half plane, 237 
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Lower limit, 14 

Lower semicontinuous function, 37 
Lusin’s theorem, 55 

Mandelbrojt, S., 402 
Mapping, 7 
continuous, 8 
one-to-one, 7 
open, 99, 214 
(See also Function) 

Marcinkiewicz, J., 173 
Maximal function, 136, 138, 241 
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